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Abstract 



This work deals with two problems arising in mathematical ecology. The first problem 
is concerned with diploid branching particle models and its behavior when rapid stirring 
is added to the interaction. The particle models involve two types of particles, male and 
female, and branching can only occur when both types of particles are present. We show 
that if the branching rate is sufficiently large, this particle model has a nontrivial stationary 
distribution, i.e. one that does not concentrate all weight on the all-0 state, using a compari- 
son argument due to R. Durrett. We also show extinction for small branching rates, thereby 
establishing the existence of a phase transition. We then add two different rapid stirring 
mechanisms to the interactions and show that for the particle models with rapid stirring, 
there also exist nontrivial stationary distribution(s); for this, we analyze the limiting PDE 
and establish a condition on the PDE that guarantees existence of nontrivial stationary 
distributions for sufficient fast stirring. 

The second problem deals with a model of sympatric speciation, i.e. speciation in the 
absence of geographical separation, originally proposed by U. Dicckmann and M. Docbeli in 
1999. We modify their original model to obtain several constant-population particle models. 
We concentrate on a continuous-time model that converges to a deterministic dynamical 
system as the number of particles becomes large. We establish various results regarding 
whether speciation occurs by studying the existence of bimodal stationary distributions for 
the limiting dynamical system. 
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Chapter 1 

Introduction and Overview 



This work consists of two parts, each of which involves a class of models arising from a 
problem of mathematical ecology. In the first problem, we study diploid branching particle 
system models. This class of particle systems differs from the "usual" models that one 
normally finds in the literature, in that there are two types of particles, modelling the male 
and female populations, and branching (i.e. birth of new particles) requires the presence of 
both male and female particles. In the second problem, we study various particle models 
that are all related to a sympatric speciation model proposed in [Dicckmann and Doebeli 
1999]. In both particle models, we are mainly concerned with the equilibrium behaviour. 
More specifically, we show that the stationary distributions of the particle models have 
desirable properties, e.g. nontriviality (i.e. does not concentrate all weight on the all-0 
state) in the first problem and bimodality in the second problem. 
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Part I 

Existence of Nontrivial 
Stationary Distribution for the 
Diploid Branching Particle 
System with Rapid Stirring 
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Chapter 2 

The Particle Models 



In this part of our work, we consider a type of particle systems that can be used to model 
sexual reproduction of a certain species. This work was inspired in part by [Dawson and 
Perkins 1998]. In that paper, the following system of stochastic partial differential equations 
is the object of study: 

^(t,x) = ^Au{t,x) + ( 1 u{t,x)v{t,x)) 1 ' 2 W 1 {t,x) 

^(t,x) = ^Av(t,x) + (ju(t,x)v(t,x)) 1 / 2 W 2 (t,x), (2.1) 

where A = £\ is the Laplacian, 7 > 0, and Wi(t,x) (i — 1,2) are independent space- 
time white noises on R + x M. One can associate u(x,t) and v(x,t) with the male and 
female populations of "particles" (respectively) at spatial location x and time t. Loosely 
speaking, (12. ip says that individual male or female particles moves around according to 
Brownian motion, but branching is only possible when both male and female particles are 
present at the same spatial location. Notice that at spatial locations where the female 
population is 0, the branching rate for the male population is also 0, therefore the male 
population does not "die" and the only effect on the male population at those spatial 
locations is the diffusive effect of the heat kernel A. This behaviour is not very realistic, 
since one would expect a "natural" death rate of male particles even without the presence 
of any female particles. In this work, we study a model involving a finite number of male 
and female particles with more (somewhat) "realistic" behaviour. 

The model we study involves two types of particles, male and female, residing on 
the integer grid S = Z d or eL d . More specifically, each site x € S contains two nests, one 
for the male particle and the other for the female particle. Each nest can be inhabited by at 
most 1 particle, either male or female. Let E — {0, 1} and F — E x E be the set of possible 
states at each site in S. For x £ S, we write 

z(x) = (e(x),e(x)i 

where ^(x) denotes the number (0 or 1) of male particles at site x, and £, 2 (x) denotes the 
number of female particles at site x. We define the interaction neighbourhood 

TV = {0,2/i, . . .,Vn}, 
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and the neighbourhood of x 



N x = x + N. 

For example, Af = {0, —1, 1} if we have nearest-neighbour interaction on Z. Let Ci(x, m, £) 
denote the rate at which nest m (m = 1, 2) of site x flips to state i (i = 0, 1), and assume 
Cj(a;,m, £) depends only on the neighbourhood M x , i.e. 

c z (x, m, £) = (a), £(z + j/ x ), . . . , £(x + y N )) 

for some function hi >m ■ F N+1 — ► M + . The death rate Co is constant, 

/ ,5, iff"(z) = l 
co(x,m,Q = i . , (2.2) 

[ 0, otherwise 

while the birth rate Ci(x,m, £) is positive only if both male and female particles can be 
found in J\f x . For example, the diploid branching particle model we consider in Chapter 12. II 
a bit later has 

c ^ m ^~\ o, otherwise ' { 6) 

where 

n m -(x,0 = |{^ £ A4 : + z) = 1}|, 

i.e. at rate A, each pair of male and female particles in Af x give birth to a particle at nest m 
of site x if that nest is not already occupied. A more stringent condition, as in the particle 
model with rapid stirring we consider in Chapter 12.21 a bit later, is to require both parent 
particles to reside at the same site, i.e. 

M) rw.o. ( , 4) 



where 



H +2 (x,0 - |{z e A4 : ^(x + z) = 1 and £ 2 (x + z) = 1}|. 



This more stringent condition should not alter the behaviour of the particle system if one 
allows a larger A than in (|2.3p . but it does help to simplify the analysis somewhat. 



2.1 Diploid Branching Particle Model 

We first describe the model with birth and death rates as in (|2.2I) and (|2.3p . for which we 
will establish the existence of nontrivial stationary distribution(s) and consequently a phase 
transition later in Chapter [3l First, we restate the model in words: 

1. Birth. For each nest (x,m) and each pair (21,22) £ A4 x M x such that ^{zi) = 1 
and £ 2 ( z 2) = 1, where Z\ and zi need not be distinct, with rate A, a child of (21,22) 
is born into nest m of site x if (x, m) is not already occupied. 
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2. Death. Each particle dies at rate S. 



We can think of this particle system as a generalized spin system, generalized in 



Chapter 3 of [Liggett 1985] for a detailed introduction on classic spin systems. We observe 
that the "all-0" state (i.e. ^(x) = £, 2 (x) = for all x) is an absorbing state, therefore 
the probability measure that concentrates only on the "all-0" state is a trivial stationary 
distribution. We say a stationary distribution is nontrivial if it does not concentrate only 
on the "all-0" state. A major goal of this work is to establish the existence of nontrivial 
stationary distributions for various particle systems. 

This interacting particle system involving the birth and death mechanisms described 
above can be constructed using a countable number of Poisson processes [Durrett 1995]. 
Without any loss of generality, we assume A and 8 to be < 1, since we can just slow down 
time by max(A,<5) if either A > 1 or 6 > 1. Define 



We assume c* < oo. Let {T^- l ' m : n > 1} be the arrival times of independent rate c* Poisson 
processes, and {U^ % ' m : n > 1} be independent uniform random variables on [0, 1]. At time 
t = X^' l,m , nest (x,m) flips to state i if JJ^ l ' m < Ci(x, m, £ t _)/c*, and stays unchanged 
otherwise. 

Since the number of Poisson processes is infinite, there is no first flip and the exis- 
tence and uniqueness of the process from this construction is not completely trivial. One 
can, however, use Theorem 2.1 of [Durrett 1995] to find a small to such that the spatial 
grid S can be divided into an infinite number of components, each of which is finite and no 
two of which interact during time [0, to]. This allows construction of the process up to time 
to, and by iterating this procedure, we can construct the process for all t. One can also 
see easily that this construction is unique. Alternatively, one can explicitly write down the 
generators Q 1 and Q 2 associated with the particle system with death rates p.2[) and birth 
rates (|2.3p and (|2.4p respectively: 



the sense that the phase space at each site is {0, l} 2 rather than {0, 1}. One can refer to 




E 



S X ,n) - /(£)) 



(i,m)eSx{l,2} 



(2.5) 



E 



srwm - s x , m ) - /(0) 



(i,m)6Sx{l,2) 



+ E ^W(y)(i-r(z))(/(£+4, m ) -/(O) 



(2.6) 
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where / has compact support and # x ,m is a function on5x {1, 2} that is one at (x, to) and 
zero everywhere else, and apply Theorem B3 in [Liggett 1999] (Theorem 1.3.9 in [Liggett 
1985] only gives the Markov property) to see that Q is a Markov generator and therefore 
determines a unique ({0, 1} 2 ) Z Feller Markov process. 

An important consequence of the construction using Poisson processes described in 
the last paragraph is that the semigroup T t f(£o) = E^°f(^ t ) corresponding to the particle 
system is a Feller semigroup. As in Corollary 2.3 of [Durrett 1995], one can show that if 
Co ~~ > £o> then for t < to, E^° /(£") — > E^°f(^t) since S consists of components that are 
finite and do not interact with each other during [0, to]. One can then iterate this for as 
many times as one likes. Summarizing results from the three previous paragraphs, we have 
the following theorem: 

Theorem 2.1.1. There exists a unique Feller process £ t constructed as before with genera- 
tor (OOP or WM . 

One can represent this construction graphically, for which we give an example with 
S = Z and N = {-1,0,1} in figure O Let to € {1,2}, x,y,z € S, {R^ m ,n > 1} 
be independent Poisson processes with rate 5, and {T^ ,m ' y ' z , n > 1}, with y,z e J\f x , 
be independent Poisson processes with rate A. At space-time points ((x, to), R% m ), we 
draw a symbol 6 to indicate that the particle (if any) residing at (x, to) is killed at time 
R%' m . At space-time points ((x, to), T*' m ' y ' z ), we draw arrows from ((y, 1), T£' m ' y ' z ) and 
((z, 2), T£' m - V ' z ) to ((x, m),T£' m ' v ' z ) to indicate that a birth event will occur at nest (x, to) if 
(x, to) is not already occupied and nests (y, 1) and (z, 2) are both occupied at time T^ m ' v ' z . 

In figure 12. 1( the bottom line represent the state (occupied or empty) of nests at 
t = 0. We use thick lines to represent occupied (wet) nests, and thin lines to represent 
empty nests. Without any birth or death event, the state of a nest remains unchanged as 
t increases. At a death event, i.e. at points marked by <5, a thick line is changed to a thin 
line, while a thin line remains unchanged. And at a birth event, the state of nests at the 
origins of the two arrows pointing at (x, to) is checked - if they are both occupied, then a 
thin line at (x, to) is changed to a thick line, while a thick remains unchanged; otherwise, 
nothing happens. 

In Chapter [3l we will use this graphical construction to establish the existence of a 
nontrivial stationary distribution for the diploid branching particle model if A/5 is sufficiently 
large, and extinction if X/S is sufficiently small. 

The particle system £ with generator (12. 5|) or (|2.6|) is attractive in the sense that £ 
is monotonic in initial conditions. One can check that if £o(x) < £o(x) for all ieS, where 
(0,0) < (0,1) < (1,1) and (0,0) < (1,0) < (1,1) but (0,1) £ (1,0), then &(x) < f t (x) for 
all x and t. This is true since every birth or death event preserves the inequality <. For 
example, if £t_(x) = (0,0) and £t-( x ) — (0,1), and at time t there is a male birth event 
at site x, then £t(x) = (1,0) and £t( x ) — (1, 1), so the inequality £t(cc) < |t(x) has been 
maintained. Similarly, one can check that the particle system £ is increasing in the birth rate 
A and decreasing in the death rate 5, by coupling the random variables T*< hm and f/^' 1 '™ 
involved in the constructions in the obvious way. Because of this monotonicity, along with 
the existence of nontrivial stationary distributions for sufficiently large X/S and extinction 
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-2 -1 1 2 t = 



Figure 2.1: Graphical representation: the solid lines represent nests (a;, 1), while the dotted 
lines represent nests (x, 2). Thick lines indicate occupied (wet) nests, while thin lines indicate 
empty nests. 

for sufficiently small X/6 which we will establish a bit later in this work, we may conclude 
that there is a phase transition in the behaviour of the particle system £. 

2.2 Description of the Particle Model with Rapid Stir- 
ring 

If we add rapid stirring to the particle system, i.e. we scale space by e and "stir" neighbouring 
particles at rate e -2 in addition to performing the birth and death mechanisms, then the 
particle system converges to the solution of a reaction-diffusion PDE as e — > (see Theorems 
8.1 and 8.2 in [Durrett 1995] and the beginning of Chapter I2.3p . This PDE represents the 
"mean-field" behaviour of the particle system and is usually easier to analyze than the 
particle system itself. As promised earlier, we will establish later in Chapter [3] that there 
is a phase transition for the diploid branching particle model (i.e. without rapid stirring), 
but obtaining any reasonable estimates on exactly where this transition occurs seems to 
be difficult. One advantage of adding rapid stirring mechanisms is that one can get a 
pretty good idea where the phase transition occurs in the rapidly stirred particle model by 
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analyzing the limiting PDE, or simulating this PDE on a computer. 

Moreover, this convergence establishes a connection between the particle model and 
PDE systems, which is of independent interest. Since many PDE's arise out of natural 
systems, this connection justifies the study of the PDE. The underlying stochastic system 
can also yield information about the PDE; for example, in our case, as we will see in 
Chapter 12. 3i the monotonicity of the particle system will lead to the monotonicity of the 
PDE. Information about the PDE will similarly yield information about the particle model. 
In Chapter [51 we will establish condition (*) on the PDE (see page l3"Tj). which will tell us 
that there exist nontrivial stationary distributions for the particle system with sufficiently 
small e. 

For the particle models with rapid stirring, we work with S = eZ d , and denote the 
corresponding process by £ e . We also assume the birth and death rates in (|2.2[) and (12. 4[) 
(i.e. generator Q 2 in (|2.6| ) with 5=1, while the neighbourhood AT is nearest neighbour: 



Here we use the Z^-norm: ||y|| = J], .jj. In addition to the transitions in the diploid 
branching model, we introduce spatial movement of particles between neighbouring sites 
called "rapid stirring" . We consider two rapid stirring mechanisms in this work, one called 
"lily-pad" stirring, and the other called "individual" stirring: 

• Lily-pad Stirring. For each x,y € eZ d with \\x — y\\i = e, ^(x) = (^ e,1 (x),^ e ' 2 (a;)) and 

£ e (y) = {^ e ' l (v)i^ e ' 2 {y)) are exchanged at rate e~ 2 . 

• Individual Stirring. For each i € {1, 2} and x, y € eZ d with \\x — y\\i — e, ^^(x) and 

£ e ' l (y) are exchanged at rate e -2 . 

Just as in the particle model without rapid stirring described in 12. l\ one can con- 
struct the particle model with either lily-pad stirring or individual stirring using a countable 
number of Poisson processes. Or alternatively, one can write down the generator explicitly 
and again apply Theorem B3 in [Liggett 1999] to establish: 

Theorem 2.2.1. There exists a unique Feller process £t with generator Q L for the particle 
model with Lily-pad stirring or generator Q 1 for the particle model with individual stirring: 



N = {y- h\\ = or e}. 



G L f(0 






(2.7) 



x ,yeX d ,\\x — y\\i—e 






me{l,2},x,yeZ d ,\\x-y\\ i=e 



where 





£(x,m'), ifz = y 
£{y,m'), ifz = x 



and 




£(z,m'), if (z,m') ^ (x,m),(y,m) 
€(x,m), if (z,m') = (y,m) 
t{y,m), if (z,m') = (x,m) 
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For lily-pad stirring, instead of thinking of a site that consists of two nests as in the 
diploid branching model, we can view each site as having 4 states in 

F = {0,1} 2 = {(0,0),(0,1),(1,0),(1,1)}. 

We restate the dynamics of the particle model in terms of these four states. At any site 
x G eZ d , only the following transitions are possible: (0, 0) <-* (0, 1), (0, 1) (1, 1), (0, 0) <-> 
(1,0), and (1,0) <-> (1, 1), i.e. only one particle is born or dies at a particular time. The 
rates of these transitions are as follows: 

c ( o,o)M e ) = 1 if F(x) = (0,1) or e(:r) = (1,0), 

c (0 ,i)(z,e) = c (1)0 )(z, e) = 1 if ?{x) = (1,1), 

C(o,i)(z,e) = c ( i, )(^e) = An 1+2 (a ; ,e ) if f (x) = (0,0), 

c ( i,i)(x,e) = An 1+2 (x,e) if e(x) = (0, 1) or ?(x) = (1,0). 

In words, each particle, male or female, dies at rate 1. If site a; is occupied by both a male 
and a female particle, then with rate A, it gives birth to a male (respectively female) child- 
particle at a neighbouring site provided that the neighbouring site is not already occupied 
by a male (respectively female) particle. 

For individual stirring, we still view the particle system with nests (x, m) £ eZ d x 
{1, 2} and each nest assuming one of two states in E = {0, 1}. 

The difference between these two stirring mechanisms is that lily-pad stirring forces 
male and female particles at a site to move together, but individual stirring allows indepen- 
dent movement of male and female particles. Every exchange of particles, in both lily-pad 
stirring and individual stirring, is monotonicity preserving, thus neither stirring mechanism 
disrupts the monotonicity property of the particle system. 

2.3 Convergence to a PDE for Lily-pad Stirring 

Consider the particle system with lily-pad stirring with its generator given by (12. 7p . For 
i G F, define 

ut(t,x) = P(C t (x) = i) 

then Theorem 8.1 in [Durrett 1995] shows that if gi(x) is continuous and u\(0,x) = gi{x) 
for all i, then 

Ui(t, x) = lim vS(t, x) 

e— >0 



exists and satisfies the following system of PDE's: 

= A«(o,q) + M (o,i) + u (i,o) - 2Ad«( 0) o)U(i,i) 

= Au (0,l) + ~ u (0,l) + M u (o,o) - U (0,1)) U (M) 

= Au( ljQ ) + tt(i,i) - U(i :0 ) + Ad(u (0: o) - U(i,o))^(i,i) 
= Ait (1)1) - 2w (1) i) + Ad(u (0) i) +U(i )0 ))u(i,i)- (2.9) 



^(0,0) 

dt 
dt 

^(1,0) 

dt 



dt 
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Obviously, must lie in [0, 1] for all i and t since it is a limit of probabilities. We want to 
study the long time behaviour of (|2.9p . The system (|2.9|) is 3-dimensional if one takes into 
account the condition U(o,o) + u (o,i) + u (i,o) + u (i.i) — 1- We first do two transformations 
on the 3-dimensional parameter space (it(o,o)) M (o,i)7 u Ci,o)> u (i,i)) to obtain a monotone 2- 
dimensional system, which will be easier to analyze. First, define uq = U(o,o)i u i = u (o,i) + 
U(i.o), and u 2 = then (uq, itijMa) satisfies: 



du 

dt 
dui 

du 2 



Auq + ui — 2XcIuqU2 

Aui + 2u 2 — Ui + Xd(2uQ — Ui)u2 

Au 2 - 2u 2 + \duxU2- (2-10) 



The above system can be written as the limiting PDE under rapid stirring of another particle 
system £ c , still on S = eZ d , with state space F = {0, 1, 2}, and transitions <-*• 1 and 1 2 
at rates 

c (x,C e )-l if C e (^) = 1 

ci(x,C e )=2 ifC £ (x)=2 
Cl {x,C) =2\n 2 (x,C) ifC £ W=0 
c 2 (x,C e ) =An 2 (x,C e ) ifC e (z) = l, 

where 

n 2 (x,?) = \{zeAf:C(x + z) = 2}\. 

Under this model, monotonicity still holds: if Q(x) < Co( x ) f° r all x (here the ordering of 
F is the usual one: < 1 < 2), then Q(x) < Q(x) for all x and t, since every transition still 
preserves the inequality < . Let (u\ (0, x), u|(0, x)) = (gi(x), g 2 (x)) and (uf (0, x), u|(0, x)) — 
(<?i(x), g 2 (x)) be two sets of initial distributions such that gi + g 2 < gi + g 2 and g 2 < g 2 
everywhere. Then for all x, 

ul(0,x) < u 2 (0,x) 
u\(0,x) + u £ 2 (0,x) < u\(0,x) + w 2 (0,x), 

so it is possible to set up two initial conditions Co and Coj sucn that P(£q(x) — i) — w|(0, x) 
and P((q(x) — i) — u|(0, x), i — 1,2, and Q(x) < C6( x ) holds for all x and w. Since 
Ct ( x ) ^ Ct( x ) f° r a ^ ^ aim x i the monotonicity property of £ implies 

P(C t e (z) > 1) < P(g(aO > 1) and P(C t (x) > 2) < P(C t {x) > 2), 

i.e. for all t and x, 

x) < a;) 

x) + U 2 (<, x) < X) + Ul(t, X). 

Now we transform the parameter space a second time by defining (a, 0) = {u\ + u 2 , u 2 ) and 
writing c = Xd, then (at,/3t) is monotone in the initial condition since u\(t, x) — > Ui(t,x). 
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In particular, if (1 — a, a — (3,(3) and (1 — a, a — (3,(3) are both solutions to (|2 . 10[) with 
ao(x) < ao(x) and (3q(x) < (3o(x) for all x, then ctt(x) < ott(x) and (3t{x) < (3t(x) for all x 
and t. Straightforward calculation shows that (a, (3) satisfies the following system: 

Aa + (2c - 2ca + l)/3 - a 
A/3 + (c(a -p)- 2)(3. 

Since (iti,U2) S [0, l] 2 , (a, (3) lies in the triangular region 

K = {(u,v) :0 <u,v <l,u>v} (2.11) 

for all t > 0. We change variables from (a, (3) to (u, v) and summarize this paragraph in the 
following lemma: 

Lemma 2.3.1. The PDE 

Am + (2c(l -u) + l)v-u 

Av + (c(u - v) - 2)v (2.12) 

is monotone in initial conditions that lie in 1Z — {(u, v) : < u,v < l,u > v}, i.e. if 
there are two initial conditions (uq,vq) £ 1Z and (uo,vq) G 1Z, with uq < uo and vq < vq 
everywhere, then Ut < Ut ond Vt < Vt everywhere, for all t; furthermore, both (ut,Vt) and 
(ut, Vt) lie in 1Z for all t. 

In Chapter [5l we will analyze (|2.12[) to establish the following theorem: 

Theorem 2.3.2. If X/5 is sufficiently large and e is sufficiently small, then there exists 
a nontrivial translation invariant stationary distribution for the diploid branching particle 
model with lily-pad stirring with generator 

2.4 Convergence to a PDE for Individual Stirring 

Unlike lily-pad stirring, Theorem 8.1 in [Durrett 1995] cannot be directly applied to get 
convergence to a PDE system for individual stirring. We can, however, follow the ideas 
used in the proof of that theorem 8.1 to establish a corresponding result, Theorem 14.0.51 
We consider the particle model with individual stirring described in Chapter 12. 21 For i 6 E, 
define 

<m(M) = P{Ct{x,m) = i). 

Then Theorem 14.0.51 implies that if gi >m : M — > [0, 1] is continuous and itf TO (0, x) = gi. m (x), 
then 

Ui, m (t,x) = limul m (t,x) 



da 
~dt 

dt 



du 

di 
dv 

~ot 



n 



exists and satisfies the following system of PDE's: 




where we define 



c = \d 2 . 



Since uq,i + = "0,2 + ui,2 



1, it suffices to study the PDE for u\,\ and 111,2: 



<9i 



Au 1;1 — ui i + 2c(l — Mi 2 



#1*1,2 



Aui j2 - Ul,2 + 2c(l - Ul,2)wi,lWl,2- 



(2.13) 



<9t 



Notice that if we start with a symmetric initial condition, i.e. g^i = g i 2 , then the solution 
to (|2.13[) is also symmetric. And if we define u = un = U\p, then we obtain the following 



This PDE has been analyzed in [Durrett and Neuhauser 1994] as their sexual reproduction 
model (example 3 on page 291). In fact, it is not difficult to see that if u\\ = 1*1,2 then 
choosing the "father" from the male population is exactly the same as choosing the "father" 
from the the female population, hence it is quite natural for this reduction to occur. Theorem 
4 of [Durrett and Neuhauser 1994] states that if c > 2.25 then the sexual reproduction model 
of Durrett and Neuhauser has nontrivial stationary distribution(s). Although this theorem 
does not directly apply to our particle system £ e with 2 types of particles because of the 
difference in stirring mechanisms, one can nevertheless trace through the proof of Lemma 
3.3 of [Durrett and Neuhauser 1994] while making obvious changes, to establish a similar 
result: 

• Let < px < pa < 1 be the two nonzero roots of f(u). Define (3 = (po — pi)/10 and 
Ik — ILkei + [-L, L) d . If e is small, L is large, and £ e (0) has density at least p\ + j3 of 
both male particles and female particles in Jo, then for sufficiently large T, with high 
probability £ e (T) will have density of at least po — (3 in Ii and 

This result can then be fed into a comparison argument, comparing the particle system 
with oriented percolation, as on page 312 of [Durrett and Neuhauser 1994] or in the proof 
of Theorem 13.3.21 later on, to establish the existence of nontrivial stationary distribution(s) 
for the particle system £ e under individual stirring with sufficiently small e. Then we have 
the following theorem: 



PDE for u: 



du 
~dt 



Au + f(u), 

-u + 2c(l - u)u 2 . 



(2.14) 



/(«) 
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Theorem 2.4.1. If X/5 is sufficiently large and e is sufficiently small, then there exists 
a nontrivial translation invariant stationary distribution for the diploid branching particle 
model with individual stirring with generator \2.8\) . 

We will not explicitly write down the details of the proof, but instead refer the 
interested reader to [Durrett and Neuhauser 1994] for details. 
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Chapter 3 

Results on the Diploid 
Branching Particle Model 



In this chapter, we assume the model with generator (|2.5|) described in Chapter l2.1[ i.e. the 
particle system with birth and death mechanisms, but no stirring. We briefly restate the 
model to remind the reader: the rate at which nest to of site x flips to state i, Ci(x, to, £), is 

co(xmn = { 5, if £ m ( x ) = 1 Cl r x m £) = ( *ni(x,0n 2 (x,0, if£ m (x) = 
' ' 1 0, otherwise ' ' 1 0, otherwise 



where 



n ml {x^) = \{^M x :C n {x + z) = l\l 



and the M x contains the site x and its 2d nearest neighbours. The goal is to establish the 
existence of a phase transition. 



3.1 Existence of Stationary Distributions 

We first establish that stationary distributions exist. Define 

£ (x) = (1, 1) for all x. 

Let Ti/(£o) = E*°f[£t) be the semigroup corresponding to the particle system, then T t is a 
Feller semigroup by Theorem l2.1.1l We begin with a lemma. 

Lemma 3.1.1. For any i,BcS = Z d , the function 

1 1 ► P (f t (x) = Vx e A, f t (y) = 0VyeB) (3.1) 

is increasing. 

—1 —2 —1 

Proof. Let ao = £ s and (3$ — ^ s for an arbitrary fixed s. Then £ (x) > cto(x) and 

2 

Co( a; ) — @o (%)■ Let (at, fit) be the state at time t of the particle system that started 
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with initial condition (ao,/?o)- Then by the fact that the particle system is monotone in 
initial conditions, we have 

— 1 —2 

Ct ( x ) ^ ot t {x) and £ t (x) > j3 t {x) 
for all t and x. Thus by the Markov property of C, 

P (f t (x) = Vx e A, cl(y) = Vy g b) 
< P (a t (a?) =0Viei, /3 t (y) = Vy G B) 
= P (£ +t (x) = Vx G A, e' +t (y) = Vy G b) . 

This implies that the function in (|3.1[) is increasing in t. □ 

Theorem 3.1.2. As t — > oo, £ t =>■ Coo- TTie Zimii is a stationary distribution that stochas- 
tically dominates all other stationary distributions and called the upper invariant measure. 

Proof. For arbitrary subsets A, B , C = {x\, . . . , x m }, and D = {yi, . . . , y„} of S, we write 
P (t t (z) = Vz G A, f t (w) = Vw e B, g(as) = 1 Vz G C, Ct (y) = 1 Vy G £>) 

Cm+n \ 
i 7 ' 

where 

E i = ift( z ) = Vz G A U {a;,}, ^(to) = Vw G B} if i = 1,. ..,m 

and 

Bi = {|t (z) = Vz G A, ct(™) = 0VmeBU {y 4 -m}} if i = m + 1, . . . , m + n. 
We can use the inclusion-exclusion formula on B(U™~ 1 n Bi), i.e. 

(m+n \ m+n 
=E - E n Ej) + . . . + (-i) m +"+ 1 p(£; l n . . . n e 3 ). 
i=l / i=l i<j 

Every term in the above expansion is in the form of 

P(f t (z) = V* G •, ?t H = Vu> G ••), 

which is increasing in t by Lemma 13.1.11 Therefore 

P(f t (z) = Vz G A ft H = Vra e B, £(*) = 1 Vx G C, cl(y) = 1 Vy G £>) 

converges for all A, B, C, and D, i.e. all finite dimensional distributions converge. Thus 
a weak limit (Coo > Coo) ex ists and it follows from a standard result that since T t is a Feller 
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\ _2 _1 _2 

semigroup, (£00,^00) is a stationary distribution. We can also easily see that (£00, £00) dom- 
inates all other stationary distributions: let (£o>£o) be another stationary distribution and 
(£\£ 2 ) be the process with initial condition (£o,£ 2 ), then (£t>£t) nas the same distribution 
as (^O)Co) f° r all t > and (C 1 ,^ 2 ) dominates (^,C 2 ) because of monotonicity, therefore 
(So . So ) dominates (I* , | 2 ) . □ 
Theorem III. 2. 3 in [Liggett 1985] establishes the previous theorem for spin systems 
with state space {0, 1} S , therefore it does not directly apply to our case. One can, however, 
easily adapt the proof of that theorem to this case, and obtain a slightly different proof. 

3.2 Extinction for Sufficiently Small X/5 

Theorem 3.2.1. If \\J\f\ 2 < S, then the particle system £ has no nontrivial stationary 
distribution. 

Proof. We compare a modification of the particle system £ with the contact process. We 
recall that the contact process £ on Z d has two states and 1 at every site x, and has the 
following dynamics: 

co(x,Q = ( 5 ' ii ^ = 1 , Cl (x,0 = ( an ( x >®> ifC(*)=0 
\ 0, otherwise ' ' \ 0, otherwise 

where n(x, C) = \{z G M x : ((z) = 1}|. Theorem 2.6 of [Durrett 1995] states that i£a\Af\ < 5, 
then the contact process has no nontrivial stationary distribution. 

We modify the mechanism of the particle model £ as follows: for the males, when 
(x, 1) G Sx {1} seeks out a pair of parents, say (xi, 1) and (X2, 2), it is no longer required that 
£ 2 (^2) = 1, but ^(xi) must still be 1. Correspondingly, when a female nest (a;, 2) G S x {2} 
seeks out a pair of parents (xi,l) and (x 2 ,2), it is only required that £ 2 (x 2 ) = 1. We 
denote this modified process £ = (£ 1 ,^ 2 ). The result of the modification is that £ x and £ 2 
are now decoupled, and behaves exactly the same as the contact process with birth rate 
a = X\Af\. Furthermore, by Theorem III. 1.5 in [Liggett 1985], the modified process (C 1 ,^ 2 ) 
stochastically dominates the original process (C 1 ,^ 2 )- If o:\J\f\ < 5, then £ has no nontrivial 
stationary distribution and (£j,£ 2 ) converges weakly to the all-0 state as t — > 00 for any 
initial condition. Thus (£t,<S; 2 ) & l so converges to the all-0 state for any initial condition if 
a\J\f\ = A|7V| 2 < S, as required. □ 

The proof above also shows that if J2 X £o( x ) + £o( x ) ^ s nn hc, then the population 
dies out in finite time a.s. if A|A/"| 2 < S, since the contact process or £ 2 has this property. 

3.3 Survival for Sufficiently Large X/5 

We use the idea of Chapter 4 of [Durrett 1995], i.e. we compare the particle system to an 
oriented percolation process. First, we define the oriented percolation process W. Let 

Cq = {(x, n) G Z 2 : x + n is even, n > 0} 



16 



and make Co into a graph by drawing oriented edges from (x,n) to [x + 1, n+ 1) and from 
(x, n) to (x — 1, n+1). Site (a:, n) is said to be a parent of sites (x + 1, Ji+1) and (x— 1, n+1). 
Notice that any site (x, n) with n^O has two parents. We think of n as the time variable. 
Given random variables lu(x, n) that indicate whether site (x, n) is open (1) or closed (0), we 
say that (y, I) can be reached from (x, to) if there is sequence of points x = z m , . . . ,zi = y 
such that \zk — 2ft— 1| = 1 for m < k < I and u>(zk, k) = 1 for to < £; < Z; in this case, we 
write 

(x,to) -> (y,Z). 

We say that w(x, n) (n > 1) is 11 M -dependent with density at least 1 — 7" if whenever (xj, n^), 
1 < i < I, is a sequence with || (xi, nf) — (xj, Tij)||oo > f° r * 7^ i; we have 

P(Lj(xi,m) = for 1 < t < J) < 7 J . (3.2) 

Given an initial condition Wo C 2Z = {x : (x, 0) S £0}, the process 

Wn = {y : (x, 0) — > (y, n) for some x G Wo}. 

gives all sites that can be reached from a site in Wo at time n. We say sites in W„ are wet. 
Theorem 4.2 of [Durrett 1995] states: 

Theorem 3.3.1. Let W% be an M-dependent oriented percolation with density at least 
1 — 7 starting from the initial configuration Wq in which the events {x £ Wq }, x £ 2Z, are 
independent and have probability p. If p > and 7 < 6" 4 ( 2M+1 ) , then 

liminf P(0 € W 2 P „) > 19/20. (3.3) 



This theorem shows that if the density of open sites 1 — 7 is sufficiently close to 1 
and we start with a Bernoulli initial condition for Wo, then the probability that is wet at 
time t does not go to as t — > 00. Notice that the right hand side of the estimate (|3.3|) is 
a constant that does not depend on p. We will construct an oriented percolation process 
that is stochastically dominated by the particle system £, such that existence of nontrivial 
stationary distribution for the oriented percolation process implies existence of nontrivial 
stationary distribution for the particle system. 

Theorem 3.3.2. If X/8 is sufficiently large, then the particle system £ with generator \2. 5\) 
has a nontrivial stationary distribution. 

Proof. We follow the method of proof as in Chapter 4 of [Durrett 1995] . Since scaling time by 
an factor of 1/5 does not change the behaviour with respect to stationary distributions, we 
may assume without any loss of generality that 6 = 1. We will select an event G^ measurable 
with respect to the graphical representation in [— 1, 1] x Z d_1 x [0,T), i.e. measurable with 
respect to the filtration generated by all the Poisson arrivals {R^'" 1 } and {T^ m,y ' z } used to 
construct the particle model in Chapter 12. ll that arrive at any sites in [— 1, 1] x Z d_1 during 
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the time interval [0, t). For any 7 > no matter how small, there is A and T and an event 
G| with 

P(G & )>l- 7> 

so that on G €o) if &(0, 0, . . . , 0) = (1,1), then fr(l, 0, . . . , 0) = f T (-l, 0, . . . , 0) = (1,1). 
One can achieve this by choosing T so small that the probability of any death occurring at 
any nests of sites (—1,0,..., 0), (0, 0, ... , 0), and (1,0,..., 0) is less than 7/2; then one can 
choose A large enough so that the probability of having birth events from ((0,0, . . . ,0), 1) 
and ((0, 0, . . . , 0), 2) to each of the four nests at sites (— 1, 0, . . . , 0) and (1,0,..., 0) during 
[0, T) is larger than 1 — Z, In other words, if we define the event 

G^ = {There are no death event during [0, T) at sites (—1,0,..., 0), (0, 0, . . . , 0), 
or (1, 0, . . . , 0); and there are birth events from ((0, 0, . . . , 0), 1) 
and ((0, 0, . . . , 0), 2) to each of the four nests at sites (— 1, 0, . . . , 0) 
and (1,0,..., 0) during [0,T)}, 

then G£ satisfies the requirement and P(G^ ) > 1 — 7 for some A and T. G^ is the "good 
event" that will ensure male and female particles get born at sites x — 1 and x + 1 provided 
site x is inhabited by both a male and a female particle. See figure EU] for an illustration of 
this event. 







1 


1 
1 
1 
1 






1 
1 
1 























-1 1 

Figure 3.1: Graphical representation of the event G^ : at least 4 birth events and no death 
events. 

We start with a configuration with events {£o( x ) = 1} an d {£o( x ) = 1} au indepen- 
dent and having probability pi and P2 , respectively. Let 

X n = {x : (x, n) e £0, £„t(x, 0, . . . , 0) = (1, 1)}. 
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Before defining the oriented percolation process W n , we first define V n that will turn out to 
be slightly larger than W n but nevertheless dominated by X n . 

We define V n inductively. First, we set Vq = Xo and leave cj(-.O) undefined. Now 
assume that Vq, Vi, . . ., V n , and uj(x, I) with I < n— 1 have been defined such that Vb C Xo, 
. . ., V n C X„. If a; € V n then we set 

1, if G(T_ xei (£ nT ) occurs in the graphical representation 

(G cr _ xei (j nT ) is G^ translated by — xe± in space and ,^ ^ 

-nT in time) 
0, otherwise 

which ensures that lo{x, n) = 1 with probability more than 1 — 7 if a; € V n . For completeness, 
if x £ V n , then we set w(x,n) equal to an independent random variable that is 1 with 
probability 1 — 7 and with probability 7. Now we define V n +i to consist of all sites 
(x, ri + 1) with cither 

x — 1 G Ki arid cj(x — 1, 71) = 1 (3-5) 

or 

x+leVn and u(x + l,n) = 1. (3.6) 

For (x', n) £ F„ C X„, the definition ofX n means that £, n T(x' , 0, . . . , 0) = (1,1); if oj(x', n) = 
1, then the "good event" occurs in the space-time rectangle 

[x' - l,x' + 1] x Z^ 1 x [nT, (n + 1)T), 

which, since Cmt(x', 0,...,0) = (1,1), implies that 

Z(n+i) T {x' - 1, 0, . . . , 0) = (1, 1) and i {n +i)T{x + 1, 0, . . . , 0) = (1, 1). 

Therefore the conditions (|3.5p and (|3.6p ensure that any (x,n + 1) G is a member of 

X n+ \, hence V n +\ C X n+ \. By induction, for all n G Z + , we have 

V n C X„. 



cj(x, n) = 



The cj(x, 7i ) thus defined is a 2-dependent oriented percolation on Co with density 
at least 1 — 7; notice that {w(0,0) = 0} and {w(2,0) = 0} are dependent events since 
both require that no death occur during [0, T) at nest ((1, 0, . . . , 0), 1) or ((1, 0, . . . , 0), 2), 
but {w(0,0) = 0} and {a>(4, 0) = 0} are clearly independent. Also, {a>(xi,rii) = 0} and 
{<x>(x2,7i2) = 0} are independent for any x\ and xi provided n\ + 1 712- Thus if (xj,n,-), 1 < 
i < /, is a sequence with || (x^, 7ij) — (xj, ) 1 1 00 > 2 for i 7^ j, then the events {w(xi, rii) = 0} 
are all independent, which implies 

P{u}(xi,m) = for 1 < t < J) < 7 J , 

as required by (|3.2[) . Notice that even though V n +\ clearly depends on V n , lo{x\, 71+ 1) and 
uj(x2, n ) as defined by (|3.4|) arc indeed independent since they relate to independent Poisson 
arrivals in disjoint space-time rectangles. 
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• G occurs 

O G does not 
occur 



(-4,0) (-2,0) (0,0) (2,0) (4,0) 

Figure 3.2: Illustration of V n and W n : all sites in W n are connected to (2, 0) via a sequence 
of solid lines, while sites in V n \W n are connected to some site in W n via a dotted line. The 
shaded rectangle indicates the space-time region that affects whether G^ occurs 

Now we define 

W n = {(x, n) : (x, n) S V n and lo(x, n) = 1}, 

then W n C V n and W n is a 2-dependent oriented percolation with density at least 1 — 7. If 
7 is sufficiently small, then by Theorem 13.3. II 

liminfP(0 € W^) > 19/20. 

Since the particle system £ dominates V n , which in turn dominates W n > we have 

limmfP(6nr(0,0,...,0) = (1,1)) > 19/20. (3.7) 

n — >oo 

Thus if we start with initial configuration £q(x) = (1, 1) for all x (in this case, p = 1 for 
Theorem I3XH . then by Theorem l3~L2l | t ^ Joo- And (EB implies that 

^(looO) = (1j !)) > 19 /20 for any x G Z rf , 
i.e. the upper invariant measure ^ is nontrivial. □ 

Remark 3.3.3. Using the same idea of comparing the particle system £ to an oriented 
percolation process, one can show that if the initial condition is finite, e.g. £o(0, 0, . . . ,0) = 
(I, f) and everywhere else, then for sufficiently large X, 

limmfP(£ nT (x,0,...,0) = (1,1) for some x) > 19/20 
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for some T. Here we use the following fact (see Theorem 1^.1 of [Durrett 1995]) about 
C a = {(y,n): (0,0) ^ (y,n)}: 

Ifl< 6- 4 ( 2M+1 ) 2 7 then P{\C Q \ < oo) < 1/20. (3.8) 

Remark 3.3.4. In fact, the proof of Theorem \ 3.3.1\ also establishes the following: if the 
initial population has a positive density of male and female particles everywhere and X/S is 
sufficiently large, then at sufficiently large times, the density of (1,1) -sites (i.e. sites with 
both a male and female particle) is larger than 9/10. Similarly, using \3.8\) . the following is 
also true: if the initial population is nonzero and X/S is sufficiently large, then at sufficiently 
large times, the probability of survival ( i. e. existence of a site with both a male and a female 
particle) is at least 9/10. 
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Chapter 4 

Convergence Theorem for 
Individual Stirring 

In this chapter, we establish the convergence result for the individual stirring model as 
promised in Chapter 12.41 We work in a slightly more general setting and consider random 
processes 

e t :eZ d x {1,2,...,M}-{0,1,...,k-1}. 

We call each x £ eL d a site, and each (x, m) £ eZ d x {1,2,..., M} a nest. There are M 
nests at each site. We think of the set of spatial locations 1 d x {1, 2, . . . , M} as consisting 
of M "floors" of Z d . Let 

N = {0,ej/i, . . . ,ey N } 

be the interaction neighbourhood of site and rj^ = max^gjv/ ||a;|| i be its radius. The process 
£ t e evolves as follows: 

1. Birth and Death. The state of nest (x, m) flips to i, i = 0, . . . , k — 1, at rate 

Ci(x,m,g) = hi, m (£(x,m),£(x + ez\,mx), ...,£(x + ez L ,m L )), 
where L is a positive integer, z±, . . . , zl £ Af, m\, . . . , mj, £ {1, 2, . . . , M}, and 

/ 1 , !m :{0,l,.., K -l} I+1 ^J{Cl + 
with hi^ m {i, . . .) = and K compact. 

2. Rapid Stirring. For each m £ {1, 2, ...,M} and x,y £ eL d with ||a; — y||i = e, 
^ £ (cc,to) and £ e (y,m) are exchanged at rate e~ 2 . 

This individual stirring model differs from the Lily-pad stirring model described in 
Chapter l2.2l in that the stirring action between corresponding nests at neighbouring sites are 
now independent. More specifically, exchanges are allowed between neighbouring nests on 
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the same floor only, i.e. between (0, 1) and (e, 1) but not between (0, 1) and (e, 2). We will 
show, in the theorem below, that this individual rapid stirring action between corresponding 
nests "decouples" (in the limit e — > 0) the dependence between all nests, at neighbouring 
sites and even at the same site. 

As an example, for d = 1, in the particle model with individual stirring with gener- 
ator (HH), we have k = 2, M = 2, L = 4, M = {0, -e, e}, 



c (x,m,£) 

and 

ci(x,m,£) = 



<5, if £(2;, m) = 1 
0, otherwise 



\{i{x - e, l)i{x - e, 2) + £(z + e, 2)£(x + e, 1)), if £(x, m) = 
0, otherwise 

In particular, we should define 

(zi,toi) = (-1, 1), (z 2 ,m 2 ) = (-1,2), (z 3 ,m 3 ) = (1,1), (z 4 , m 4 ) = (1, 2), 
and hi — h i rn as 

ho(a , a-L, a 2 , a 3 , a 4) = 5a , 

and 

hi(ao, a\,a 2 , 03, 04) = A(aia2 + a3Q!4)(l — ao), 
where ao = £,(x, m), a\ — £(x + ezi, mi) = £(x — e, 1), etc. 

Theorem 4.0.5. Suppose {£,q(x, m), (x, m) £ eZ d x {1, 2, . . . , M}} are independent and 
let uj m (t,x) = P(£t(x,m) = i). If u\ m (0,x) = gi, m (x) = gi(x,m) and g t : R d x 
{1, 2, . . . , M} — > [0, 1] is continuous, then for any smooth function <f> with compact support, 
as e —t §, 

ed ^y^lHliy^i)^} ->• / <t>(y)ui,m(t,y) dy, (4.1) 

y£eZ d 

where Ui tTn (t,x) is the bounded solution of 
dui.n 



= Aui m + fi tm (u), u im (0,x) =gi(x,m), 
at 



fi, m (u) = (ch(0, m, £)1(£(0, m) ? i)) u - ( Cj (0, m, f)l(f (0, m) = i)) u , 

and (4>{£,))u denotes the expected value of 0(£) under the product measure in which state j 
at nest m has density Uj^ m , i.e. £(x,m), with x £ eL d and 1 < m < M, are independent 
with P(£(x, m) = j) = Uj, m . 
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Proof. We follow the program used in the proof of Theorem 8.1 in [Durrett 1995]. We first 
define a dual process for the particle system in part (a), then in part (b) we show that 
the dual process is almost a branching random walk. We will not explicitly write down 
parts (c) and (d) of the proof, which almost exactly resemble parts (c) and (d) of the proof 
of Theorem 8.1 in [Durrett 1995]. But to summarize these two parts, part (c) establishes 
that the dual process converges to a branching Brownian motion as e — > and defines a 
candidate limit Ui >m (t, x) of u\ m (t, x), and part (d) shows that this candidate limit satisfies 
the PDE in the statement of the theorem. Finally, part (e), which we write out explicitly, 
shows that convergence described in (|4.ip does occur, in addition to the convergence of the 
mean u\ m (t,x) to u i<m (t,x). 

a. Defining the dual process. The dual process associated with nest (x, in) and 
a fixed time t is a random process if' m, *(s), s S [0,t], where 

if ■*•*(«) G |J {(x,m) :xe eZ d ,me {1, 2, . . . , M}}* (4.2) 

consists of a finite number of particles residing at nests (x, m) E eL d x {1,2,..., M}. The 
events that influence the behaviour of the dual process Jf ' m >* at time s are those Poisson 
arrivals in the original process £ e that occur at time t — s. We start with 

if^'(0) = {(x,m)} 

and evolve if ,m,t (s) until s = t, which corresponds to rolling back the clock in the original 
process £ e from time t back to time 0. The process if ' m,t is constructed such that: in order 
to know the value of £ e (x,rh) at time t, it suffices to do a computation using values of £ e at 
time and nests (x,m) S if ■*'*(*). We call if' A >*(s) the influence set of if' A '*(Q). 

In order to define the dual process, we need to give a graphical construction for the 
particle system £ e similar to the one given in Chapter 12.11 We define a family of uniform 
[0, 1] random variables {U^ m ' 1 : n > 1} and two families of independent Poisson processes 
that respectively correspond to the birth/death flips and the rapid stirring mechanism: 

^ T x,m,i . n > 1} at rate c * = sup ^ m c .( a , 5 m; 

and {S^ y ' m : n > 1} at rate e~ 2 , 

with m £ {1, 2, ... , M} and \\x — y\\i = e. Notice that c* < oo since max^ m ||/ii jTO ||oo < oo. 
At T*'" 1 ' 1 , we check all nests (x,m), (x + ezi,mi), ...,(x + ezL,m,L) and use C/f ,m ' 1 to 
decide whether nest (x, m) should flip to state i based on the function hi iTn . And at S^' y ' m , 
we exchange values of £ e (x,m) and £ e (y,m), the corresponding nests at two neighbouring 
sites. The evolution of the dual process /f' m >*(s) depends on the Poisson arrival times 
{ T x,m,i :n >ij and {S£ y ' m : n > 1} in the following way: 

1. If (x, m) € if ™'*(s-) and T%< m,i =t-s, then we set 

7*,A,t( s ) = Jf.*.*( fl _) u {(x + ezi, mi), . . . , (x + ez L , m L )}. (4.3) 

Therefore each particle (x, m) £ if ,m, *(s) gives birth to L new particles at rate c*. 
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Figure 4.1: Illustration of the dual process: there are two birth/death events here, each 
giving birth to four additional particles, two on each floor; the first birth event is from a 
male (to = 0) nest, while the second birth event is from a female (m = 1) nest. 



2. If (x, to) e Jf>™>*(s-) and either S%' y > m = t - s or S^ x > m =t-s, then we set 
If^(s) = (/^(s-A^to)}) U {(y,m)}. 
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Thus two dual processes /f' m,t (s) and If ' m >*(s) evolve independently from each other 
except at time 

s= someT^" 1 ' 1 
where (x, m) G Jf ' m >*(s) n If ' m '*(s) or at time 

s= someS*> v < m 

where (x,m) G If> m,t (s) and (y,m) G If ,m '*(s), or (y,m) € If ,m,t (s) and (a;,m) G 

An equivalent way of describing the dual process is to define the pair 
(Xf ™(s),/Cf ™(s)) for < s < i, where Xf' ™(s) is the ordered set 

Xf'*(s) = (Xf*' (s), . . .,JCf ,A,x:f '* (0) ~ 1 (8)) 
and each Xf ,mj (s) G eZ d x {1,2,..., M}. For s = 0, define the number of particles 

fcf ,lh (0) = n 
for some n G Z + and the location of particles 

Xf™(0) = (x j ,m j ) 

for < j < n, such that 

/Cf'*(0)-1 

jf'*'*(Q) = (J {.V; '" ';())[. (4.4) 

i=o 

where >*•*(()) is defined in (fl^]) . Typically, iCf-*(0) = 1 and Xf>™>°(0) = {(x,m)}, but 
we use the more general initial condition since we will define Xf ,m,J inductively. Each 
particle in Xf' m ' J , < j < ICf ,m , jumps to a neighbouring nest on the same "floor" of 
eZ d x {1, 2, . . . , M} as dictated by the stirring mechanism (i.e. 5*-arrivals), until a T-arrival 
of type T x < m <i with < j < /Cf' A (s-) at some (x,m) G Xf' A (s-). At time s of this 
T-arrival, we set 

££•*(«) =i + /Cf™(s-) 

and 

X e (s) = (x + ez k ,m k ) 

for < fc < L, while leaving all existing Xf ,mj 's unchanged. Observe that 

7«.A.t( a ) = (J {Xf' A ' J '(s)}, 
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therefore the relation (|4.4[) is maintained for all s £ [0, t]. Afterwards, the ,m (s) particles 
jump according to the ^-arrivals until the next T-arrival at some (x,m) S {Xf :mj (s'),0 < 



A new particle may be born at a nest where a particle already resides; if this happens, 
we say that a collision occurs, and call the new particle fictitious. We prescribe this fictitious 
particle to give birth to L particles at rate c* and jump to a neighbouring nest on the same 
"floor" at rate e -2 , independently from all other particles, fictitious or not; furthermore, 
all offsprings of a fictitious particle are defined to be fictitious as well. If the number of 
collisions is 0, then there are no fictitious particles. As will be seen in the next paragraph , 
this is in fact the case in the limit e — > 0. Note that the stirring mechanism does not cause 
particles at different nests to "collide", i.e. end up at the same nest, because only exchanges 
between nests occur under the stirring mechanism. 

b. Characterizing the dual process. Having finished defining the dual process, 
we proceed to show that with high probability, the dual process can be coupled to a branching 
random walk and the movement of one dual process is independent from the movement of 
another dual. This part of the proof is quite similar to part (b) of the proof of Theorem 8.1 
of [Durrctt 1995]. 

First, we couple X^'" 1 ^ to independent random walks Y™' m, i that start at the same 
location at the time of birth of Xf ,mj and jump to a randomly chosen neighbour at rate 
2de~ 2 . We define the distance between two particles on two different "floors" to only depend 
on the site location: 



and say Xf' m 'i is crowded if for some k ^= j, WX^'" 1 ^ — Af' m ' fe ||i < r^. Recall that 
is the radius of the interaction neighbourhood N . When Xf ' m J is not crowded, we define 
the displacements of Y*- m -i to be equal to those of Af' mj . But when Xf' mj is crowded, 
we use independent Poisson processes to determine the jumps of Y* ,m 'i , To estimate the 
difference between Xf ,mj and Y*' m ' J , we need to estimate the amount of time Xf' mj is 
crowded. If j ^ k, then V* = Xf- m ^(s) — Xf' m ' fc (s) is stochastically larger than W s £ , a 
random walk that jumps to a randomly chosen neighbour in eL d x {1} at rate 4de~ 2 (see 
page 175 of [Durrett 1995] for details on how to couple these two processes so that one is 
stochastically larger than the other). Strictly speaking, V* is only defined after the j th and 
k th particles of X*' m are created, but as we shall see below, we are only interested in an 
upper bound on the occupation time of V* and in a ball, so the fact that these particles 
may only start to exist at some positive time only helps matters. Thus for any integer M > 1, 
W( Me = |{ s < t : \\V s e \\i < Me}\ is stochastically smaller than wf Ic = \{s < t : || W s £ ||i < Me}\. 
Well known asymptotic results [Durrett 1995] for random walks imply that if teT 2 > 2 then 



In the estimates above, d = 1 is the worst case, so we use Ewf ' u < CMet 1 / 2 if ie 2 > 2 
from now on. 



j < K\ < A (s'-)}. 



||(xi,mi) - (x 2 ,m2)||i = - x 2 ||i, 




(4.5) 
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The amount of time X^ m ^ is crowded in [0,t], denoted Xe(*)> can t» e estimated as 

follows: 

oo 

Ehm] = E E[ X i(t)\IC^(t) = K]P{Kr\t) = K) 

K=0 

oo 

< E KE[w^]P{lCf™{t) = K), 

K=0 

where we pick M large enough so that Me > r^, e.g. M = ~ max 0<i <Ar In what 

follows, C ... is a constant whose value may change from line to line. Then 

E[xi(t)} < E[w^]E[JC^{t)] = e c ' Lt E[wf u ] < C M e c * Lt ei 1/2 

if teT 2 > 2. To see the middle equality above, we observe that the branching mechanism 
of the dual process described in (|4.3p occurs at rate c* for both fictitious and nonfictitious 
particles, and every time a branching event occurs, one particle is replaced by L particles; 
therefore the mean number of branches at time t is e c Lt . It follows that 

E[ X i(t)} < C M e c ' U tt X ' 2 + 2e 2 < C M e c ' Lt e{l + t 1 ' 2 ). (4.6) 

This means that the expected number of births from Xf ,rnj while there is some other X*' m ' k 
in TV + Xg'^j is smaller than 

C LM e c ' Lt e(l + t^ 2 ). (4.7) 

Thus 

P(at least one collision during [0, £]) 

< P(at least one collision during [0, t]|£f*(t) < e-°- 5 )P(/Cf -*(t) < e" 5 ) 

+P(/Cf' A (t) > e" - 5 ) 

< P[# of collisions during [0, i]|/Cf •*(*) < e -°- 5 ]P(/C^™(t) < e" ' 5 ) 

+e a5 P[/Cf" i (i)] (4.8) 

< C L , M e c * Lt e a5 (l+t 1 / 2 ) + e - 5 e c * Lt 

< C L , M e c ' Lt e°- 5 (l+t^ 2 ), 

which — > as e — > 0. We use (|4.7|) and the condition K^' m {t) < e~ - 5 to bound the first 
expectation in flUgJ) , and we bound P(/CJ' A (t) < e -0 - 5 ) in [[ITg]) above by 1. 

This shows that the probability of at least one collision within a single dual during 
time [0, t] tends to as c — > 0. Furthermore, the same argument shows that the probability 
of at least one collision between two different duals for nests (x,rh) ^ (x',rh') during time 
[0, t] also tends to as e — > 0. For that, we observe that the estimate (|4.5p is independent of 
the initial condition W^; so in particular, this estimate still holds even if ||Wq||i = 0, which 
is the case when for example one considers two duals for two nests (x,rh) and (x, m!) at the 
same site x. Hence two different duals are asymptotically independent in the limit e — > 0. 
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The estimate (j4.6[) also leads to the following estimate on the difference between 

X x,ra,j and yi.mj ( see page yjQ ; n [ Durrett 1995 ] f or details): 

P ( max \\Xf' ™' j (s) - if ■* J '(a)| 00 > 2e 03 ] < Ce 0A (l + t^ 2 )e c * Lt . (4.9) 

This shows that with high probability, the movements of all the particles in a dual can be 
coupled to independent random walks, in addition to being independent from the movement 
of any other dual. 

c. and d. Defining a candidate limit and showing the limit satisfies the 

PDE. We will not write down the details of these two parts of the argument, since they 
are almost exactly the same as parts (c) and (d) of the proof of Theorem 8.1 in [Durrett 
1995]. From estimate (|4.9p . it is not too difficult to see that the dual process converges to 
the branching Brownian motion as e — ► 0. We can then define the candidate limit Ui tTn (t, x) 
(of u\ m (t, x) = P(^(x, m) = i)) using the limiting branching Brownian motion as the dual 
process. Part (d) then establishes that the candidate limit u^ m [t, x) satisfies the integral 
from of the PDE in the statement of the theorem. 

e. The particle systems converge. So far we have established that 

w i,m(^' * u i,m{tj x), 

i.e. the expected value converges. It remains to establish (|4.ip . For this, we define for a 
bounded function <fi with compact support supp(</>) of diameter D, 

(&,<!>)= e d ^(f) 1 {«(w."»)=i}- 

Then 



E[{$,<f>)] = e d J2 mPl&fam) = l ) = e d J2 0(»Km(*>iO 

y£eZ d y€eZ d 

(f>{y)ui tm (t,y) dy 



(4.10) 



by bounded convergence. Now we compute the variance of (£f , i 
Var[(£,0)] 



= E 



= E 



,y£el. d 



e 2d ]T </>(y) 2 (l my , m)=i} -P(%(y,m)=i))* 

yet1 d 



E 



e 2d y MM*) 

y,z£eZ d ,y=£z 



< D 



2d l 



x (l{£f(3/,m)=i} - P{&{y,m) = i)){l{ it ( z , m )=i} - P(Ct(z,m) = i))] 

llg Sup Cov[l {5 |(y !m)=i} , l{4|(«,m)=i}] 

y ,z£eZ d nsupp(4>),y^z 



Ml 



yGeZ d nsupp(0) 
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We observe that l{£ t € (y,m)=i} is a random variable taking values in {0, 1} and therefore has 
variance < 1/4. Also, part (b) of the proof shows that 

COV e = Sup Cov[l{£*(y, m ) =i }, l{£!(a, m )=i}] 
y t z&e% d ,y^z 

as e — > 0. The argument leading to the asymptotic independence of two duals in part 
(b) works for any two nests, so Cov[l{£«(j, iiri ) =i }, l{^( z .m)=i}} goes to zero uniformly for all 
y 7^ z. Now we have the following estimate on Var[(^, <j>)\: 



Var [(£,</>)] <D 



2d M J.II2 



,d|| J.II2 



which — > as e — * 0. Thus by (|4.10|) and Chebyshev's inequality, wc have 



P 



(£l>0) - / Hy) u i,m( t ^y) d v 



< p 



> s 



yGeZ d 



(d ^K,m(*'!')" / <f>(y) u i,m(t,v) dy 



y£el 



> 5 



< P 



(£,0}-e d ^ 0(»K m (t,y) 



(5 

>2 



< 



-P 



4Var[(£ 



e d X! ^(J/K',m(*>y) _ / <t>{y)ui >m {t,y) dy 



y£& 



> 



P 



e d ^iy) u i,m{^v) - j </>(y)ui, m (t,y) dy 

y<£eZ d 



> 



as e — > 0, and the theorem follows. 



□ 
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Chapter 5 



Existence of Invariant 
Stationary Distribution For 

Lily-pad Stirring 



In this chapter, we establish the existence of nontrivial stationary distribution of the particle 
system with lily-pad stirring as promised by Theorem 12.3.21 First, we rewrite (|2 . 12[) in the 
statement of Lemma 12.3.11 



We show that for sufficiently large c, the solution to (|5.1[) with initial condition uq = /, vq = 
9> f > 9 satisfies the following condition: 

(*) There are constants < D\ < d\ < d^ < T>% < 1, L, and T so that if Vq(x) € (D±, D2) 
for x £ [—L,L] then vt(x) € (^1,(^2) for x G [—3L,3L]. 

According to Chapter 9 of [Durrett 1995], this is a sufficient condition for the existence 
of nontrivial invariant stationary distribution for the particle system with sufficiently fast 
stirring, so Theorem 12.3.21 will follow once condition (*) is established. Recall that Theo- 
rem [3321 establishes that the diploid particle model without rapid stirring has a nontrivial 
stationary distribution if the birth rate A is sufficiently large. If one traces through the proof, 
however, one will find that "sufficiently large" in that argument means that A is larger than 
a number on the order of 6 100 , which is not too informative on where exactly the critical A 
for the phase transition is. On the other hand, one can get a far better idea of exactly for 
what A condition (*) holds. 

For this proof, we also establish condition (*) for sufficiently large c (recall that 
c = Xd), but here "sufficiently large" means that c is "only" larger than a number on the 
order of 100. We assume dimension d = 1; extension to d > 1 is straightforward. The proof 
consists of two parts: the first part, Chapter l5.1| establishes the existence of constants d\ 



du 
~di 

dv 



Au + (2c(l -u) + l)v-u 



Av + (c(u -v)- 2)v. 



(5.1) 
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and Di, and the second part, Chapter l5.2[ establishes the existence of constants di and D2; 
the second part will be easy once the first part has been established. 

Theorem 9.2 in [Durrett 1995] establishes condition (*) for a specific predator-prey 
system with phase space {0, 1, 2} at each site. The critical fact used in the proof is that the 
associated ODE system (i.e. the dynamical system that results when one has constant initial 
conditions) has only one interior equilibrium point and has a global Lyapunov function. The 
phase portrait of the ODE associated with (|5.1[) , however, shows that it has two interior 
equilibrium points, one of which is always a saddle point. See figure [5731 for two examples. 
Thus it does not look likely that the ODE system associated with (|5.ip has a readily iden- 
tifiable global Lyapunov function. The method we use to establish condition (*) for (|5.1|) 
is ad hoc, but does seem to apply to a wide variety of reaction-diffusion systems where the 
reaction part of the system is 2-dimensional (or even 3-dimensional), i.e. ^ = Au + f(u) 
where / : M 2 -> K 2 . 

As established in Lemma [273J] the PDE (|5.1|) is monotone in initial conditions that 
lie in TZ = {(u,v) : < u,v < l,u > v}. This fact is critical for the proof of existence of 
constants d\ and D\ in condition (*) above - it enables us to bound the initial condition 
Vo(x) below by a function, say Uq^x), both vq and Uq having values < D\ for x £ [— L,L], 
such that if the solution v t (x) to (|5.ip with initial condition 2lo( x ) satisfies the condition 

v T (x) > di \fx € [— 3L, 3L], 

then vt{x) > d\ Vx <G [— 3L, 3L] as well. We will also need results regarding the ODE 
associated with (JO): 

(2c(l -u) + l)v-u 

(c(u-v)-2)v. (5.2) 

The above ODE system is also monotone in initial conditions, since if the initial condition 
for the PDE system in (|5.1|) is constant in x, then the solution (tt t , vt) to (|5.ip also remains 
constant in x for all time and therefore satisfies the ODE system in (|5.2p . 



du 
~~dl 

dv 
II 



5.1 Lower Bounds: Existence of d\ and D\ in Condition 

(*) 

First we recall the definition of the region TZ from (|2.1ip : 

1Z = {(u,v) : < U, v < 1, u > v}. 

If the initial condition (uq, vq) lies in TZ, then so does the solution (u tl Vt). We will establish 
the existence of constants di, D%, L, and T using the nonlinear Trotter product formula 
(Proposition 15.5.2 from [Taylor 1996]): 

(u t ,v t )= lim U'^T^Y (f,g). (5.3) 

n—*oo \ / 
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Here the convergence occurs in the space BC^R), the space of functions whose first deriva- 
tives are bounded and continuous on R and extend continuously to the compactification R 
via the point at infinity; the norm used here is || • + ||^(-)||oo- In (|5.3[) . e sA (f,g) gives 
the (independent) evolution of / and g for time s according the heat equation 

du 

at = Au 

dv 

at = Av > 

and J- S (f, g) gives the pointwise evolution of (/, g) for time s according to the ODE in (|5.2|) . 
i.e. for all x, if (uq(x), Vo(x)) = (f(x),g(x)) then T s {f,g){x) = (u 8 (x),v s (x)) where 
(u(x),v(x)) evolves according to (|5.2p . Note that both e sA and T s are monotone in initial 
conditions, therefore so is e sA T & '. 

To establish the existence of constants D\ and d\ in condition (*), it suffices to 
show that for any initial condition (uq,vq) with vq dominating the function DiJ[_£,m(x), 
for sufficiently large T, (ut,vt) is such that vt dominates the function dijf[_3L,3£](^)- 

1 1 



-/ I -L-l -L -L + l L-l L L + l 

(a) The function h (b) The function /o 

Figure 5.1: The functions h and /q. 

Let H 2 (R) = {/ e i 2 (R) : ||/|| = J(l + f)\f(0\ 2 ^ < oo} denote the Sobolev 
space with parameter 2, where / denotes the Fourier transform of /. Equivalently, H 2 (M.) 
consists of i 2 -functions with L 2 -second derivatives. We first define /o € H 2 (R) that will be 
the "shape" of the initial conditions {uq,vq): 

IC 1. f (x) = 1 for x e [-L + l, L-l}; 

IC 2. f (x) = for x e (oo, -L-l)U[L + I, oo); 

IC 3. f Q (x) = h{x+L) for a; G [-L-Z,-L+Z] and/ (a;) = h(L-x) for x G [£-/,£+/], 
where /i G iJ 2 (R) is the following function: 



m-)= r vv^ 2 „?.;-, ■ (5.4) 
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In the above definition, the choice of L is arbitrary as long as L > I, but we will later on 
choose / small such that A/o is large in [— L — I, —L + 1] U [L — l,L + l], We call the intervals 
\—L — I, —L + 1} and [L — I, L + I] the "transition regions" . We observe that h is continuous 
at x = 0, with 



h"(x) 



p if -I < x < 
if < x < I 



so the graph of h in the plane is symmetric about the point (0, i) and also, 



|A/o| < ^ (5.5) 



everywhere. 

We pick the initial condition to be uq = ao/o,Vo = £>o/o with (ao/o(0), &o/o(0)) = 
(ao,&o) € TZo, where TZq is the region for the top tip of the line segment O(ao,6o) to be 
defined later in (|5.19|) . See figure E21 for an illustration of IZo and the line segment O(ao, bo). 
We will define a family of parallel piecewise-linear curves ABC(u , vq) (see figure that 
satisfy the following requirements: 

ABC 1. ABC(u ,v ) lies in TZ ; 

ABC 2. ABC(uq,Vq) passes through the points A, B, C, and (u ,vq); 
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ABC 3. A lies on the line u — v, B lies on the line u = v + 0.1, and C lies on the 
line u = 1; 



ABC 4. ABC(u ,v ) = AB(uq,v ) U BC(u a , vq), where line segments AB(uq,vq) 
connects A and B, and BC{uq,vq) connects B and C; 

ABC 5. A_B(uo,«o) makes an angle of —6 (0 < 8 < arctan(0.05)) with the positive 
u-axis and BC(uq, vq) is a horizontal line segment. 

We will establish the following: 

Proposition 5.1.1. If c is sufficiently large and (ao,&o) & {( u > v ) € ^ : 0-55 < w < 0.8} ; 
iften /or sufficiently small s, we have 

e sA F s (~aofv,b a fo) > {o.f.Xf.), 

where fo is defined in (IC 1-3) on vaae \33\ > means that > holds in each component, and 
a s , b s , and f s satisfy the following conditions: 



1. a s and b s are constants depending on s, such that the curve ABC(a s , b s ) lies above the 
curve ABC{aQ,bo), and the vertical distance separating them is at least 82s. In par- 
ticular, since ABC(ao,bo) lies above the horizontal line v = 0.5, so does ABC'(a s ,b s ). 

2. 



fs(x) 



' 1, x e [-L + I - 6is,L - I + S 1 s] 

h(x + L + Sis), x e [— L — I — 5±s, — L + I — 6is] , , 

h(L + Sis - x), x e \L-l + 8is,L + l + 61s] '' ° 

,0, x G (00, — L — I — Sis] U [L + I + Sis, 00) 



i.e. f s is fo with each of the two transition regions is translated by Sis away from the 
origin. 

3. Si and 82 are positive constants independent of s and (ao,oo)- 

The proof of the above proposition requires a few lemmas and will be deferred until 
the end of Chapter 15. 1.21 Proposition 15 . 1 . ll states that e sA jF s (ao/o, &0/0) is bounded below 
by (a s f s ,b s f s ), where (a„,b s ) moves 82S above ABC(ao,bo). Furthermore, by (|5.6[) . the 
region where the value of (a s f s ,b s f s ) (and hence e sA J 7S (aofo,bofoj) is equal to or above 
(a s ,b B ) has expanded by 81s, both to the left and to the right, while the transition regions 
of (a s f Sl b s fs) are shifted left or right by Sis but maintains exactly the same profile as in 
the initial condition. Thus by the monotonicity of e sA F s , we can iterate e sA JF s enough 
times and obtain information about the evolution of the PDE (|5.1[) for large time. From the 
construction of the piecewise linear curves ABC, requirement (5) implies that B(0.8, 0.8) on 
AB(0.8, 0.8) has w-coordinate > 0.75. Therefore Corollarv l5 . 1 . 2l below is an easy consequence 
of Proposition l5~l. II Let 

ir v (u ,v ) = v , (5.7) 

and L^J = max{z £Z:z< x}. 
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Corollary 5.1.2. If c and T are sufficiently large, and vq(x) > 0.55 for x G [— L + 1, L — I] 

then 

n v ((e sA T s ) lT/sl (uo,vo)(x)) > 0.7 
for x G [— 3L, 3L] and sufficiently small s. 

In other words, the constants D\ and d\ in condition (*) are picked to be D\ = 0.55 
and d\ = 0.7. Note that we restrict vo(x) = bofo(x) to be > 0.55 for x G [—L + l, L — l] in the 
above corollary because Prop 15. lTTI only works for (do, bo) € {(u,v) G 1Z : 0.55 < v < 0.8}. 
Also note that the "L" in condition (*) is picked to be L — I. 

5.1.1 Analysis of the ODE (IQl) 

We first characterize the phase portrait of the ODE. We carry this out for sufficiently large 
c. See figure [S~3l for phase portraits with c = 5 and c — 25. Define 



fy(w, i>) = (rji(u, v), rj2{u, v)) — ((2c(l — u) + \)v — u, (c(u — v) — 2)v), (5-8) 

such that the solution to the ODE (|5.2p . (u s ,v s ) = ^(uo^vq), flows along the vector field 
77. Define the curves 71, 72, and 73: 



71 = :«G [0,l],t> = — — , (5.9) 

1^ f + 2c — zcu J 

72 = |(«,«) :«£ [0,1],« = «- (5.10) 

73 = {(u,«):ue[0,l] s « = «}. (5.11) 

We have 771 = on 71 and 772 = on 72. An easy calculation shows that for all c, (0, 0) and 
(1,1) pass through 71. We observe that 771 is a linear function in u for fixed v, so 771 > to 
the left of 71 and 771 < to the right of 71 . By similar reasoning, we also have 772 < to the 
left of 72, while 772 > to its right. The two intersection points of 71 and 72, 



P_ = 




are the only equilibrium points of 77 in the interior of 1Z, with O — (0, 0) G dlZ being the 
third equilibrium point. Elementary computation shows that O and P + are stable, but P_ 
is a saddle point, thus one would expect any point that lies significantly above P_ to flow 
toward P + under 77. Elementary calculations also show the following: 

(5.12) 





•(1,1) 


as 


c - 


-> 00 


P- -t 


(0,0) 


as 


c - 


-> 00 


p_,„ 

P- V 


— > 00 


as 


c - 


-> 00 



where and P-^ denote the u- and u- coordinates of P_, respectively. 
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(a) c = 5 (b) c = 25 

Figure 5.3: Phase space of the ODE 

We will need some crude estimates of r\\ and 772 . First of all, since u — v > 
everywhere in 1Z, we have 

772 > -2v. (5.13) 
If the point (u,v) is at least S to the right of 72, then since T]2(u,v) = on 72, 

172(11, v) > Scv, (5-14) 
Similar reasoning shows that if the point (u, v) is at least 5 to the left of 71, then 

r]i(u,v) > 2Scv. (5.15) 
Now the horizontal distance between "fi and 73 at a fixed v is 

(l + 2c)v 

d(v) = — v. 

w l + 2cv 

Simple calculations show that 

2c(2cv 2 + 2v-l) 4c(l + 2c) 
d (v) = 7- — rr^ and d (v) = — — — -77. 

v ' (l + 2cw) 2 v ; (l + 2ci;) 3 

Notice that d"(v) < if v, c > 0, so for v G [0, 1], d(v) is a strictly concave function, and 

1 



v= — (vT+2^-1) 

is the unique point where d(v) attains its maximum for v G [0, 1]. We observe that 

v — > as c — > 00. (5.16) 
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If c is sufficiently large such that the horizontal line v = e lies above the line v = v but 
below the line v = 0.8, then for v €E [e, 0.8], the minimum of d(v) occurs at v = 0.8, and 

as c — > oo. This shows that for arbitrary e > and sufficiently large c, the minimum 
horizontal distance between 71 and 73 for v S [e, 0.8], is larger than 0.19. This fact will be 
needed a bit later on in the proof of Lemma f5. 1.31 We also observe that since d(v) is strictly 
concave for v <E [0, 1], the curve 71 written as u = u(v) is also strictly concave for v S [0, 1]; 
then (|5. 16(1 and the fact 

l + 2c- y/T+2c 

u(v) = ► 1 as c — > 00 

v ; 2c 

imply that 

71 ^ {(«,«) :i) = 0,ue [0, 1]} U {(u, v) : u = 1, v € [0, 1]} as c — ► 00. (5.18) 

This finishes the characterization of the phase portrait of ODE (|5.2[) . These facts, which 
are admittedly tedious and not much fun to establish, will all be required later in the proof 
of Lemmas 15.1.31 and 15.1.51 

We define the region (see figure 15. 2[) 

TZ Q = Uu, v)e7Z:u< ^ + 2c ^ _ .04, 0.55 <v< 0.8}, (5.19) 
1 + 2cv 

Recall that 71 defined in (15.91) is the curve v — -,,„"„ — or u — , ~[^ v , and therefore 
the region 1Z Q lies at least 0.04 to the left of the curve 71. By (ABC 5), the line segment 
AB forms a small negative angle with the positive u-axis, so by requiring v > 0.55 in the 
definition of TZq, we can be sure that all of ABC(ao, bo) lies above the horizontal line v = 0.5 
if (do, fro) S 72-0 ■ F° r an Y point (u,v) £ TZq, we have u < 2v and (u,v) lies at least 0.04 
to the left of 71. Since our initial condition has form (ao/o, &o/o)> the set of values in each 
"transition region" 

{(ao/oO), b fo(x)) :xe[L-l,L + I}}, 

forms a line segment with endpoints O and (ao, 60) in the (u, u)-plane. We require that the 
tip of this line segment (ao, bo) lies in the region TZo- We do not need to worry about the case 
where the initial condition for the PDE (|5.1[) is such that (ao, bo) € 1Z ("1 {0.55 < v < 0.8} 
lies to the right of TZq. If we want to establish condition (*) for that initial condition, then 
by the monotonicity of the PDE (|5.ip . it is sufficient to pick a' a < a such that (a , bo) £ TZo 
and prove condition (*) for the initial condition (a /o,&o/o)- Therefore we only consider 

(a , 6 ) in ^o- 

Assuming (ao, 60) lies in TZo, a part of the line segment O(ao, 60) still lies below the 
horizontal line v — 0.55. To study the evolution of the whole line segment under J- v , we will 
a bit later consider two cases: 1. e < v < 0.8, and 2. < v < e, where we will pick e = 0.24 
in Chapter 15.1.21 We will construct piecewise linear curves ABC(vo, vq), vq G [0.55,0.8], 
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with A = (vo,vq), B, and C satisfying the requirements laid down in (ABC 1-5) on page 1331 
in the proof of the following two lemmas. See figures [5T5l and [5~7l for an illustration of each 
lemma. 

Lemma 5.1.3. (Case 1) If (a ,b ) lies on AB(vo,vq) = AB(ao,bo) with a — b < 0.09 
and 0.55 < bo < 0.8, then for sufficiently small s, there exist a s and b s with b s > 0.5, and a 
positive number K independent of s such that AB(a s .b s ) — AB (vo,vq) and 

F°{aa , ab a ) > ((1 + Ks)aa s , (1 + Ks)ab s ^j (5.20) 

for all a £ [0, 1]. Moreover, the constant K can be chosen to be arbitrarily large if c is also 
allowed to be arbitrarily large. 

Remark 5.1.4. Using some easy geometric considerations, one can say the following: if e 
is fixed and c is allowed to be arbitrarily large, then there exists an arbitrarily large positive 
number K depending on e but independent of s, such that if a £ [■£-, 1] then 

((1 + Ks)aa s , (1 + ks)ab s ) - (aa s ,ab s ) > (j^Ks, Ks^j , (5.21) 

and if a £ [0, 4-) then 

((1 + Ks)aa s , (1 + Ks)ab s ) - (aa s ,ab s ) > (0, 0). (5.22) 



Lemma 5.1.5. (Case 2) If (a ,b ) lies on ABC(v ,v ) = ABC(ao,b ) with 0.08 + b < 
a ° ^ itIct — an ^ < °o < 0.8, then for sufficiently small s, there exists a positive 
number K such that, if a £ [g^, 1], then 

T^{aa ,ab Q ) - (aa ,ab ) > {^-Ks, Ks \ , (5.23) 

and if a £ [0, then 

J-^(aao, ab ) — (aa ,ab ) > (— 2aa s, — 2ab^s). (5-24) 

Moreover, the constant K can be chosen to be arbitrarily large if c is also allowed to be 
arbitrarily large. 

In case 1 above (Lemma 15.1. 3p . (a s ,b s ) changes with s, but in case 2 
(Lemma I5.1.5[) . (a s ,b s ) remains fixed and equal to (ao,&o)j thus (a s ,b s ) is not explicitly 
defined. As will be seen later on, I is picked small so that the lower part of the "transition 
region" (of (aofo, b Q f )) moves up at a sufficiently large speed under the heat kernel to cancel 
out the downward movement as described in (|5.24[) . But the heat kernel pushes down the 
top part of the "transition region" , so K (and thus c) is picked large to cancel out that 
effect. And finally s is picked small so that the movement caused by J-^ is small. 

In case 1, we assume (ao, bo), the top tip of the line segment formed by the "transition 
region", lies to the left of the line u — v + 0.09, while in case 2, we assume that (ao,6o) 
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lies to the right of u = v + 0.08. There is a thin strip, i.e. 0.08 < u — v < 0.09, where 
both cases apply, so we can apply either case 1 or case 2 there. Let (uq,vq) be a point 
on the line segment O(cto,&o)- Intuitively, we would like to view "progress" as an increase 
in u-coordinate, which is measured by ir v ( Ti (uq , vq ) — (uo,Vq)). In case 1, however, it is 
not always possible for the ^-coordinate to increase. So instead, we measure progress with 
respect to the family of parallel lines AB, each of which makes a small negative angle 
with the positive it-axis, and thus allowing the ^-coordinate to decrease slightly while still 
making "progress" with respect to AB. More specifically, we compare J-^uq^vq) not with 
(uq,Vo) = (^-ao, f^M) but with (u s ,v s ) — (f^a s , ^b s ). We show that with respect to the 
lines AB, the entire line segment makes progress with respect to the lines AB when moving 
under rj. The u-coordinate actually increases very rapidly, so we move very quickly into 
where case 2 applies. In case 2, we compare ^(uq^vq) directly with (uq,vq), and show 
that the part of the line segment with w-coordinate > e makes progress, but the part of the 
line segment with ^-coordinate < e actually makes small negative progress. This negative 
progress will be compensated by positive progress made under evolution according to the 
heat kernel (to be shown in Chapter I5.1.2[) . 

Proof of Lemma 15.1.31 We first define 

Tlx = {(u,v) eTl: u -v G [0,0.1], v G [e, 0.8]}, (5.25) 
Tl\ = {(u,v) G Ki : u-v G [0,0.09]}. (5.26) 
TZ 3 = { (u, v) G K: u-v G [0,0.1], u<2v,ve [0,e)}. (5.27) 

Later in the proof of Lemma 15-1 .51 we will also define the follow three regions, which we 
include here for easy reference (See figure [5^4"]) . 

( 1 + 2c)v 

Tl 2 = {(u, v) £ 71 : v + 0.02 < u < n > - 0.04, v 6 [e, 0.8]}, 

1 + 2cv 

(1 + 2c)v 

-RL = Uu,v) G Tl 2 : u + 0.08 < u < ±— — '— - 0.04, v > 0.55}, 

1 + 2cv 

K 3 = {(u,v)eTZ:u<2v,ve[0,e)}. 

Notice that Tl' x cR^^CRj, and TZ' 3 cn 3 . 

To study the evolution of T n of a line segment OP that passes through the origin O 
and has its top tip P in the region ~R! X fTTS-Oj we will define a new vector field £ = (£i, £2) for 
(it, v) G U 7^.3, such that £1 < r\\ and £2 < V2 everywhere in IZi U 1Z' 3 , which means that 

J%(<u,v)<J*(u,v) (5.28) 

in both u- and v- coordinates, for small s and all (u, v) G TZ^UlZ^. Notice that TZ[ is required 
to stay a finite distance left of the line u — v = 0.1, the right edge of the parallelogram IZi; 
this is such that we can still control how much (it, v) G TZ[ moves under !F$, even if it leaves 
IZi and enters the strip 7£i\7?4. Also, we define £ in TZi U TZ' 3 because this is the region 
where the line segments OP lie. We only consider the region 1Z' 3 for (u, v) close to the origin, 
rather than the region 

{(u,v) £K:u-vE [0,0.1],UG [0,e)}, 
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(a) 7£i and TZ-i (b) 7?-2 and 7^3 

Figure 5.4: Various regions. 



since top tip of the line segment b s will be > 0.5, which implies that u < 2v if (it, v) £ OP. 
Step 1: Defining £. We first define £ on the diagonal line 

73 n {(it, u) : w < 0.8} = {(it, v) : u = v, < v < 0.8}, 

where 73 is defined in (|5.11[) . Let 9q < \ arctan(0.05) be a small angle such that a line 
passing through (t'Oji'o) with vq > 0.55 and making an angle of — 28$ with the positive 
it-axis intersects the vertical line it — 1 above the horizontal line v = 0.5. Define 



and pick F± to be large but 



S = 0.1, 



F l < Sc. 



We also define 



£(0.8,0.8) = (0.8Fi,0.8(-2)), 



(5.29) 



(5.30) 



(5.31) 



where F\ is large enough such that £(0.8, 0.8) makes an angle of —6\ with the positive u-axis, 
and < 61 < 6q. This can be done for sufficiently large c. We define 

= ^£(0.8,0.8) = (vF u -2v), 

for v £ [0, 0.8]. This defines £ on the line segment 73 n (IZi U TZ' 3 ). 

Finally, along straight lines that make angles of — 9\ with the positive it-axis, denoted 
A'B'(v,v), we define £ to be equal to £(v,v), i.e. for all (u',v') £ A'B'(v,v), we define 

£(u',v') = 
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u 



Figure 5.5: Illustration of Lemma [5.1. 31 (Case 1) 



Here A' = (v,v) is the point where A'B'(v,v) intersects the line u = v and £ has already 
been defined, and B' is the intersection point of A'B'(v, v) and the right/bottom edge of 
the 4-gon IZi U 1Z' 3 , i.e. either the line u = v + 0.1 or the line u = 2v. Thus we have defined 
£ on all points in IZi U 1Z' 3 . To summarize, £ in 1Z\ is defined in a way such that: 

1. On the line u = v where < v < 0.8, 



such that £ makes a small angle of —9\ with the positive it- axis. 

2. £ is constant along lines that start at a point on the line u = v, and make angles of 
—9i with the positive u-axis, where 



e = «(-2)), 



(5.32) 



9i < # < - arctan(0.05). 



(5.33) 
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Step 2: Verifying £1 < rji. We divide into two sub-cases: 

1. Tli: e < v < 0.8; 

2. 72 3 : < v < e. 

We first deal with sub-case 1. By the discussion following (|5.17p . the minimum 
horizontal distance between 71 and 73, defined in (|5.9[) and (|5.11[) . is at least 0.19 for 
v E [e,0.8] and sufficiently large c. Therefore the region 72 1 is more than S = 0.1 left of 
71 n {(u, v) : e < v < 0.8}. Thus by (I5T5]) . 

T)! > 2Scv (5.34) 

in 72.1. On the line segment 73 n 72i = {(u,v) : u — v,v £ [e, 0.8]}, £ is defined by (|5.32[) . 
Condition (|5.30|) then implies 

£1 = vF\ < dev. 

Thus (|5.34p shows that £1 < 771 on the line segment 73 (~l 72i. 

We also need to verify that £1 < 771 everywhere in 72i. For that, recall that £ 
in 72i is constant along line segments A'B'(v,v), each of which makes an angle of —Q\ 
with the positive w-axis, so we estimate how much the w-coordinate can decrease along 
the line segments A'B' , to make sure that £ < r) even on the line u = v + 0.1. We 
observe the following: since tanf?i < 0.05, the amount by which the w-coordinate decreases, 
from the point A'(vo,vq) on the line u — v to the point B' on the line u = v + 0.1, is 
0.1tan6*i < I < where we will pick e = 0.24 in Chapter 15.1.21 Thus even for (uq,vq) 
lying on the line u = v + 0.1, we still have A'B'(uq, vq) = A'B'(v\, v\) (i.e. [v\, v{) lies on 
the line u — v) for some V\ with v\ < 2v$. Because £ is constant along A' B'(uq, Vq), we have 

£i(itO)Uo) = V1F1 < 2v a Fi < 2Scv , 

where we use the requirement (I5.30|) in the last inequality. Hence for all (ug, vq) € 72i, 

€i(u ,v ) < 77i(M ,fo) 

by UEMD- 

Now we deal with sub-case 2. For v < e < \ and u < 2v, we have 

771 = (2c(l -u) + l)v-u> (2c(l - 2v) + l)v - 2t; 

= (2c(l - 2v) - l)v > cv (5.35) 

if c is sufficiently large. This estimate applies to both Case 1 (this lemma) and a bit later 
on Case 2 (Lemma 15. 1 .5[) . and shows that on 73 n 72 3 = {(u, v) : u = v, v € [0, e)}, 

£1 = vFi < 5cv = O.lcv < rji, 

where we use (|5.29|) . (|5.30|) . and (|5.35|) in the second, third, and fourth steps, respec- 
tively. For the rest of 72 3 , we make the observation that the ^-coordinate of any point 
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on A'B'(vi,vi) is larger than the ^-coordinate v-i of the intersection point of A'B'(vi, v\) 
and the line u = 2v, which we obtain by solving v — v± = — tan0i(2i> — v±), i.e. V2 — 
mW£k v i > f • Therefore for any (u ,v ) £ WW(v uVl ), 

£i(u ,v ) = viFi < OAcvi < 0.2cv < 771, 

from which we conclude that £1 < rji in TZ' 3 . 

Step 3: Verifying £2 < 772 • This is considerably easier than verifying £1 < r]\. 
From ([Q2]) . £ 2 = -2v on the line segment 73 n (Ki U K' 3 ), so £ 2 < 172 by l(IU3]) . Along 
A' B'(vi,vi), the w-coordinate decreases. So for any (u ,v ) £ A'S'(ux) 

6(«o,«o) = -2«i < -2u < 

Thus £ 2 < V2 in KiU^. 

Step 4: Defining a s , b s , and AB. The vector field £ is defined such that any 
point (uq,vq) £ 72i U 1Z' 3 moves under £ at a constant speed (linear in vo) along the line 
A' B'(uq,vo). Thus any line segment OP lying in 1Z± U7Z' 3 remains a line segment (i.e. does 
not become a curve) under the flow £, and if (u\, V\) and (tta, V2) are two points on such a line 
segment, then the ratio |^|^'"^ | remains constant. We define AB(vi,v\), < v\ < 0.8, 
to be the line segment that makes an angle of — 26* with the positive w-axis and connects 
points A = (vi,Vi) £ 73 and B, with B lying on the right/bottom boundary of the 4-gon 
IZi U 1Z' 3 . Recall from (|5.33[) that Oq is a small angle and A'B'(ao,bo) makes an angle of 
—61 with the positive u-axis, where < Q\ < 9o- Thus the part of A'B'(ao, bo) to the right 
of the point (ao,&o) lies strictly above AB(ao,bo), and the angle between A'B'(ao,bo) and 
AB(ao, bo) is at least 9o by the choice of #0 and 9\ in (|5.33|) . Also, we define 

(a s ,b s ) = AB{ao,b ) n OJ|(a ,6o), 

where (ao, 60) G Ho H 7?^ is the top tip of any line segment that we consider for this lemma. 
We collect various facts for later use: 

1. J-£ moves the point (ao, bo) to the right. 

2. The part of A'B'(ao, bo) to the right of the point (ao, bo) lies strictly above AB(ao, bo). 

3. (a s , b s ) lies on AB(ao, bo). 

4. J|(a , b ) lies on A'B'(a , b Q ) 

5. (a s ,b s ) and f^(ao,bo) both lie on O.F|(ao, bo). 

The above facts imply that b s is a lower bound for n v (J-^(ao, bo)). 

We now estimate the speed at which ,F|(ao, bo) separates from (a s , b s ). First of all, 
since A'B'(ao,bo) makes a negative angle with the positive u-axis, the intersection point 
(vi,vi) of A'B'{ao, bo) and the line u = v must lie above (ao,bo), i.e. 

vi > b . 
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This means that 

|£(ao,M = \^i,vi)\ = 1(^^,-2^)1 > boyjFf + 2 2 . 

Let 

^2(60) = b ^F 2 + 2 2 . 

Since A'S'(oo,6o) and A£?(ao,&o) make angles of — #1 and — 2#o with the positive 
it-axis, respectively, where < B\ < 9q, the angle 62 between A'B'(ao,bo) and AB(ao,ba) 
is larger than 8q. Let a = sF2(&o) be the distance between (ao,6o) and JF|'(ao,6o), and 
/3 be the distance between (ao,&o) and (a s ,6 s ), then the Euclidean distance 7 between 
!F^(ao,bo) and (a s ,6 s ) (the thick line in figure 151))) is 7 = cos 6*2, which 

attains the minimum usin#2 when f3 — acos#2, therefore 7 > si^C^o) sin#o- Since s is 
small and (ao,&o) lies in IZo n 7?4 (in particular, to the left of the line u — v = 0.09), 
.F|(ao,&o) lies in {(«, u) eR:u-«6 [0,0.1],?; > 0.5}, thus the smallest angle between 
OJ-j!(ao,bo) (portion of which is (a s ,b s )J-^(ao,bo)) and the positive w-axis is greater than 
arctan ^| > -|. Therefore the vertical distance between .F|(ao,&o) an d (a>s,b s ) is at least 
sF 2 (&o) sin 8q sin ^. Similarly, .F|(ao, &o) lies in 7£> so the largest angle between CTvf (a , b ) 
and the positive u-axis is less than ^ , hence the horizontal distance between these two points 
is at least sF2(b ) sin# cos j, which is larger than sF2(6q) sin0 o cos 



u = v 




Figure 5.6: Another illustration of Lemma [5.1.31 (Case 1) 
More precisely, 

/ 7T 7T\ 

fl(a ,b )~(a S7 b s ) > ^sF 2 (6 ) sin0 o cos -, sF 2 (6 ) sin0 o sin -J 
= Q&o^i 2 + 2 2 sin O , ibo /f 2 + 2 2 sin O 
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Since b s < bo, the above inequality implies 



Fl(a ,b ) - (a 3 ,b s ) > \^b s F? + 2* sin 9 Q ,^b s ^/F? + 22 sin Q j. (5.36) 

By the choice of a small 8q in (|5.33|) . the entire line segment AB(a,Q,bo) lies above the 
horizontal line v — 0.5 if (ao,&o) € TZo H 7?^. Thus (a s ,6 s ) lies above the horizontal line 
i) = 0.5, which means that a s < 2b s . Thus (|5.36p implies 



Fl(a ,b )- KA) > [ja s jF^ + 2 2 8hxd Q ,-b s JF^+2 2 8hx9 ). (5.37) 



If we define 



K = -7\ F l +2 2 sin0 o , 



then (|5T28]) and ([5T37f verifies condition ([5T20f for the point (a ,b ). We recall from (|OD|) 
that Fi can be chosen to be arbitrarily large if we allow c to be large. Therefore K can also 
be chosen to be arbitrarily large if 8q is fixed. 

For any other point (aao,abo) on the line segment O(ao,bo), a £ [0,1], notice 
that from ()5.32|) . £(aao,a&o) = a£(ao,6o), i-e. ^ is linear on the line O(ao,6o). Also, the 
entire line segment O(ao, &o) li es m T^i U ^3- Thus by (|5.28p . condition (|5.20p holds for all 
ae[0,l]. □ 

Proof of Lemma 15.1.51 We define 

(1 + 2c)v 

K 2 = {(u,v)eTZ:v + 0.02<u< y - 0.04. v £ [e, 0.8]}, (5.38) 

1 + 2cv 

Tlo = {(u, v)eTZ 2 :v + 0.08 < u < ^ + 2c ^ _ o.04, v > 0.55}, (5.39) 

1 + 2cv 

K 3 = {{u, v) e K : u < 2v, v E [0, e)}. (5.40) 

For this lemma, the region 1Z' 2 is the region for P, the top tip of the line segment OP that 
connects the origin O and the point P. Since we pick e = 0.24 later in Chapter I5.1.2[ any 
line segment OP, with P £ 1Z' 2 , lies entirely in 1Z 2 U 7?-3 . We follow the same steps as in the 
proof of Lemma T5. 1.31 fCase 1). 

Step 1: Defining £. As in the the proof of Case 1, we will define a vector field £, 
such that Ci < r/i and ( 2 < rj 2 everywhere in 1Z 2 U R3. First, we define 

C(1 J 1) = (F 3 , J F 3 ) > 

where we pick F3 large but with 

< F 3 < 0.01c. (5.41) 

It is convenient to define £ at the point (1,1), even though this point is not even in 1Z 2 U R3. 
For (u,v) £ 1Z 2 , we define 

C(u,v) = (uF 3 ,vF 3 ), (5.42) 
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And for (u, v) eK 3 , 

C(u,«) = (u(-2),t>(-2)). (5.43) 

Notice that ( is discontinuous across the horizontal line v = e. 

Step 2: Verifying £ < 77. The region 7?-2 is at least 0.01 to the right of 72 (the 
line u = v + 2/c) for sufficiently large c. By (|5.14[) . we have 

772 > O.Olcw (5.44) 

for all points in TZ2- The region IZ2 is also at least 0.04 left of 71, where 71 is the curve of 
u = ^Xlfv ■ Thus b y dnHSj) , we have 

r;i > 0.08cw > 0.08ce = 0.08c(0.24) > 0.01c. (5.45) 
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for all points in K 2 . By (j5"Hj) and ([51^) . 



C(u,v) < (O.Olcu, O.OIot) < (0.01c, O.Olcu), 
which implies, by (|5.44p and (|5.45[) . 

C(u,v) < r)(u,v) 

for (u, n) £ 

Recall from (|5 . 18[) that the curve 71 approaches the degenerate curve 

{(u, v) : v = 0, u e [0, 1]} U {(u, v) : u = 1, v £ [0, 1]} 

as c — > 00. Therefore IZ3 stays to the left of 71 if c is sufficiently large, and by the discussion 
below (|5.11[) regarding the sign of 771, r\\ > for (u,v) 6 IZ3. The definition of £ in (|5.43|) 
says that Ci < f° r ( u , v ) G 72-3 ; therefore 

d(u,v) < rji(u,v) 

for (it, v) G 72.3. Furthermore, (|5.13[) implies that £2(1*, u) < 772(1^ w). Thus for all (it, v) € 72-3. 

C(u,w) < 77(«,u). 

Step 3: Defining BC. We define BC(vq,vo) to be the horizontal line segment 
u = «i starting at point B(vq, vq) = (v± +0.1, vi) on the line u = v + 0.1 and ending on the 
vertical line u — 1; notice that (vo,vq) itself does not lie on the line segment BC(vq, Vq). 
This definition of BC(vq, vq) means that B(v$, vq) = (v% + 0.1, v\) is the right end point of 
the line segment AB(vq, vq), which was defined in Step 4 of the proof of Lemma [5.1.31 We 
also define 

K = Y- (5 - 46) 

Notice that K can be made arbitrarily large since F3 (picked in (|5.41jl ) is allowed to be 
arbitrarily large. 

The vector field £ is defined such that any point (u, v) € IZ2 moves in the direction 
0(u,v), i.e. C is a dilation for points in 722. But any point (u,v) € 723 moves in the 
direction of (u, v)0, i.e. C is a contraction for points in 723. Thus any line segment OP with 
P G 72^ immediately splits into two line segments under £; the two line segments, however, 
lie on the same straight line through the origin O. For the top tip of the line segment 
P = (ao, bo) € 72 2 , we have 

•^(ao, b Q ) - (oo, 60) > (sa F 3 , sb F 3 ) , (5-47) 

since the fact that both u- and v- coordinates increases under £ in 1Z 2 implies that sa F 3 
and sboF 3 are lower bounds for the increase in u- and v- coordinates, respectively. Since 
6 > 0.5 if (a , 60) S 72s, $5M§ and ([ST?) implies that 

F s c {a , b ) - (a ,b ) > (s^b F s ,sb F 3 J > (j^ Ks > Ks ^j ■ (5-48) 
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This verifies (|5.23[) for P = (do, bo) € 7Z' 2 . For any other point (aao, abo) on OP that 
lies in IZ2 (i.e. a S [^-,1]), linearity in the definition Q(aao,abo) = aC(ao,&o) in (|5.42|) 
implies (15T2li|) . 

The verification of (|5.24[) for points in IZ3 is similar. Recall from (|5.43[) the definition 
of ( in IZ3: 

C(«,«) = (-2u,-2«). 

Let P = (ao,6o) G 72-2 an d a G [0, ^-)- Then both aao and a&o decrease under £, initially 
at speed 2aao and 2ab , respectively. The speed of decrease immediately becomes smaller 
than 2aag and 2abg (respectively) after the initial movement. Thus 2aa and 2abg are 
upper bounds of the speed of decrease: 

J-^(aao, abo) — (aao, abo) > (— 2aaos, —2abos) , 

as required by (|5.24|) . □ 

To summarize the results in Lemmas 15.1.31 (Case 1) and 15.1.51 (Case 2), if the top 

tip of the line segment OP at time 0, P = (ao,&o) with bo > 0.55, lies to the left of the 

line u — v + 0.09, then we use Case 1 to define (a s ,b s ) £ AB(ao,bo) and £ < 77 where 

(a s ,6 s ) is below but to the right of (do, bo), such that £ moves (ao,&o) a t an arbitrarily 

large speed below and to the right of (ao,&o), but above (a s ,b s ). Once the top tip of the 

line segment has moved to the right of the line u = v + 0.08 but to the left of the curve 

u = ^2ct" — 0-04, (or it lies between those two at time to start with), then using Case 2, 

where (a s , b s ) = (ao, bo), we define £ < i] such that £ moves (do, bo) above and to the right 

> 

of (ao, bo), in fact, along the same direction as O(ao, 60), again at an arbitrarily large speed. 
Finally, if (ao,6o) lies to the right of the curve u — — 0.04, then we move the initial 

condition to the left of this curve and apply Case 2. 



5.1.2 Analysis of the PDE (I5TTT) 

Now we use the results obtained in the previous section about the evolution of the ODE (|5.2[) . 
together with some results on the heat equation, to study the evolution of the PDE (|5.1[) . 
First, we need to characterize how values in the transition region evolve according to the 
heat equation. We will establish two technical lemmas to that end. 

Lemma 5.1.6. If I is fixed and f = fo is as defined in (IC 1-3) on vaae \33\, then for 

^(-L-l-s-.-L-^UiL + ^L + l + s) 
and s small, we have 

e^f{x) > f(x) + 4. (5.49) 
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Proof. First, we shift / right by L such that f(x) = h(x) for x <E [—1, 1] and f(x) — h(2L — x) 
for x e [2L - I, 2L + 1]. Thus Af(x) = Ah(x) for x G (-1, I) and Af(x) = Ah(2L - x) for 
x e (2L - I, 2L + 1), where 

Ah(x) = 



We make the following observation to aid our computation: if u = e sA h gives the evolution 
of the heat equation with initial condition h, then Au = A(e sA h) gives the evolution of the 
heat equation with initial condition Ah, i.e. 

A(e sA h) = e sA (Ah). (5.51) 

Define 



0. 



l 



x < -I 
-I < x < 
< x < I 



X 



> I 



(5.50) 



k(x) = Af(x), 



then 



Ah(x), x £ (-1,1) 

k(x)={ Ah(2L-x), x e(2L-l,2L + 1) 
otherwise 



(5.52) 



By equation (5.5.10) in [Taylor 1996], the solution of the heat equation can be expressed in 
terms of an integral. 



e sA k(x) = 



1 



'47TS 



e 4s k(x — y)dy. 



(5.53) 



Using the above formula and the expression of k in (|5.52[) . we can estimate e sA k(x) for 
x G (— 1 1, —I] and s small: 



e sA k(x) 



> 



> 



> 



1 

P 

1 

P 

1 

P 



P \J-x-l V^TTS 



dy 



-x+l 



1 Aits 



--dy ■ 



-x+2L 



e 



-X-1+2L V 47TS 



: dy 



-X+Z+2L e _^_ 



do 



x+2L 

e 4= 



'47TS 



-x-i V47TS 

oo 



1 



e *<* 

-X V47TS 

oo 



-x+Z -3L- 

e 4= 



'47TS 



dy 



-x+2L _*C 

e 4 S 



-i+2L V47TS 



dy 



x-i V 47TS 

DO j 

|x|-i V 47TS 



e 4s dy — 3 
e 4s dy — 3 



1 



-x V 47TS 
1 



'47TS 



e 4s dy 
e~ T °dy 



(5.54) 
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where in the last step we use the fact that x G (— Z] implies |x| = —x > I. We can 

take s to be sufficiently small such that J £ ~/|= < 10 _5 /3, then with a substitution of 
variable in the first integral in (|5.54[) , we obtain 

e sA k(x) > If/ -^e"Tdy- 10- 5 ) , 



1 (\ /-(l^l-O/v^ 1 2 \ 



If a; G (— Z — s, — I], then |x| — I < s < yfs if s < 1, and (|5.55p implies 



1/1 „_ 8 Z" 1 1 _Z , \ J 



68 ^^U" 10 io ^ e "^J > 6P- ^ 
On the other hand, for x G (— /, — 2S0) an< ^ s sman J we a l so bave 

e sA k(x) > ± (5.57) 

since for x G (—1,1), k(x) = Ah(x) is a step function with discontinuity at 0, where Ah is 
given in (|5.50|) . 

Estimates (I5.56P and (|5.57[) on the behaviour of k under the heat kernel implies that 
for s small and x G (— I — s, — 2S0), 

deSA / s {X) = ( A ( eSA /))W = (e sA (A/))(*) > 

where we use (|5.51[) in the second equality. This establishes (I5.49[) for x € (— L — I — s, —L — 
2^q)- Verification of (|5.49[) for x G (L + i + / + s) is similar. □ 



Lemma 5.1.7. Let t > be fixed and fi — /o be as defined in (IC 1-3) on paae \33\. i.e. 

h(x + L), x G (-L — I, —L + I) 



oth 



erwise 



Let 

fix / fi(x)+mt, -L-l-t<x<L + l + t . . 

(_ (J, otherwise 

where m > 0. TTiera t/iere eiisi positive constants Si and S 2 depending on m but independent 
of t such that if 

(1 + S 2 t)h(x + L + Sit), x G (-L - I - Sit, -L + l- Sit) 
= , 1 + S 2 t, x G [-L + I — Sit, L — I + Sit] 

j2[X) ^ (l + S 2 t)h(L + Sit-x), xe(L-l + 5it,L + l + 5it) ' ( ' 



0, otherwise 



then f 2 < f 



3- 
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Proof. Without any loss of generality, assume m < 1. Let M = 1 A sup,,, gM |/{(x)|, then 
M = 1 A 1/1 = 1/1 since I will be picked to be < 1 in (|5.62[) a bit later. Define 



9i{x) 



h(x+^), x e(-L-l-$&-L + l-ffi) 

A(0), *e[-L + ;-jf,L-*+j§] 

0, otherwise 



then any small piece of the curve of g\ is /i shifted by either 0, or — with < i. 
In particular, the two transition regions in g\ are the two transition regions in /i shifted 

rat n _ mt 
3M U1 31/ ' 



W or — W' an d * ne " m iddle" region (i.e. the region sandwiched between the two 



transition regions) in g\ is the middle region of j\ expanded left and right by Since 
M = sup x6K 1/(0)1, we have 



mt 

9i{x) - h(x) < — 

for all x £ (-L — I — L + I + j$f), therefore f% > g\ everywhere and in particular, s 
fz(x) — fi(x) — mt for x € (— L — I — t, L + I + t), we have 



h(x)-gi(x)>^ (5.61) 



.'-> 



forxe (-L-1- »*,L + l + $$) 
Next we define 



Then 



mt\ , , 

./•(•'•) {l + —)gi(x). 



mt mt 
J2(x) - gx{x) = —gi{x) < — 



since g\ (x) < 1 everywhere. The above inequality and (|5.6ip imply that 

fi(x) < f 3 (x) 

for x e (-L - I — f£f ,£ + I + pf)- Then <5i = and S 2 = y satisfy the requirement of 
/2 in (|5.60p . and the proof is complete. □ 

For the remainder of this section, we will establish Proposition 15.1.11 We assume 
that the initial condition of the PDE (|5.ip is (ao/o, bofo), where fo is as defined in (IC 1-3) on 
page [221 and (ao, bo) = (do, bo) lies in the region IZo defined in (|5.19p . By Remark |5 . 1 .41 and 
Lemma [5.1.51 we can pick (a s ,b s ) e ABC(ao,b ), with (a s ,b s ) — (a ,b ) if ao — b > 0.08 
(i.e. Lemma 15 . 1 . 5| / C ase 2), such that estimates (|5.20[) - (|5.24p regarding the evolution of the 
ODE (|5.2p are valid. We will use this, together with Lemma [5.1.61 at the beginning of this 
section, to show that there is a positive constant m such that for x G (— L — I — s, L + 1 + s) 
and sufficiently small s, 



^(a fo,b f )(x) - (a s f ,b s f )(x) > (a s ms,b s ms) . 
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Finally we will apply Lemma 15.1.71 to complete the proof of Proposition 15.1.11 

We divide this task into proving two lemmas, which correspond to the two cases in 
Lemmas 15.1.31 and 15.1.51 respectively. Before we proceed, we first pick 

(5.62) 
and 

e = 0.24 < 0.5ft (-^V (5-63) 

Lemma 5.1.8. (Case 1) Recall that 

K[ U K' 3 = {(u, v) G K : u - v G [0, 0.1] and u < 2v for v G [0, e)} 

and 




K[ H K = {(u, v) G K : u - v G [0, 0.09] and v G [0.55, 0.8]}. 

If{(aofo{x),b fo{x)) : x G [-L — I, -L + l}} C ^ U K' 3 and (a ,b ) G i?i nK , where TZ , 
1U 1 and 1Z' 3 are defined in i5.19\) . 15. 25]) . and t5.26\) respectively, and fo is defined in (IC 
1-3) on paae \33\ then the conclusion of Proposition [5.1.11 holds. 

Proof. By Lemma 15.1.31 and Remark 15.1.41 for sufficiently small s, we can pick K and K 
large enough such that 

K > 1 (5.64) 

and a point (a s ,b s ) G TZ with b s > 0.5 such that (o a /o(0), b s f(0)) G AB(a f (0), &o/o(0)) 
and 

^(a / (a;), b f (x)) > ((1 + Ks)a s f (x), (1 + Ks)b s f (x)) (5.65) 
for all x; furthermore, if b s fo(x) > e, 

(1 + Ks)bJ (x) - b s fo(x) > Ks, (5.66) 

and if b s f (x) G [0, e), 

(1 + Ks)bJ (x) - b s fo(x) > 0. (5.67) 

Here, (a s fo,b s fo) is the function to which we compare ^(ao/oj&o/o) to see how much 
"progress" we are making in increasing the w-coordinate. 

Forxe [-£-2M> L +25o] = [-L - ^, -L + l]u(-L + l, L-l)u[L ^ I, L + 
where the intervals [— L — — L + Z] and \L — I, L + ^q] are in the transition region, we 
have 

b s Mx) > 0.5/o | -L - — ) > 0.5/o ( —L - —] = 0.5ft ( — ) , 
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therefore by (|5.63|) . 

bj (x) > e. (5.68) 
Therefore by (|5.66|) we have, for x 6 [-L — 555, L + ^gg], 

(i + jf s )y W-yo(i)>ifs. 

For i£ [— L — I,— L— 250) U (Z + -Aq,L + 1] where b s f is possibly smaller than e, by (|5.67[) . 
we have 

(l + Ks)b s f (x)-b s f (x)>0. 
To summarize, combining (|5.65[) and the two inequalities above, we have 

w„(^(ao/o,6o/o))(a:) > (1 + Ks)b s f {x) (5.69) 

and 

{> Ks, x e [-L - 2^0, L + 2^0] 
>0, xe[-L-l,-L-^)U(L + ^,L + l] .(5.70) 
= 0, x$[-L-l,L + t\ 



(1 + Ks)b s 




Figure 5.8: The effect of the heat kernel on the function (1 + Ks)b a fo- The arrows indicates 
whether (l+Ks)b s fo(x) increases or decreases. The effects illustrated here are lower bounds. 
In [-L — 2^p, L + 2^j], the function decreases, which is why we need the first line (|5.70| to 
be > Ks to cancel out this decrease. 



As stated in ()5.5|) . |A/ | < 4. Therefore the heat operator e sA applied to b s fo may 
decrease its value by at most yf s. More precisely, 

e sA 6 s / - b s f > -js. (5.71) 

everywhere. We can use (|5.70p and (|5.71|) to obtain estimates on e sA J r *(a /o, b Q f ). We 
estimate the "progress" made after applying the heat kernel: by (|5.69[) and the monotonicity 
of the heat kernel e sA , 

7r v (e sA T:,(aofo, b fo) ~ {a s fo, b.f ))(x) > e sA (l + Ks)b s f (x) - bj (x). 
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Foixe [-£- 555, £+355], 

e sA (l+Ks)b s fo(x)-b s f Q (x) 

= (1 + As) {e sA bJ (x) - b,f (x)) + ((1 + Ks)b s f Q (x) - fe s /o(z)) 

, ~ . b s 3b s 
> -(l + Ks^s+^-s 

by (|5JT|1 . the first line of (|5J0|) . and the fact K > $ > ^f. Therefore 

7r4e sA ^(a /o,fco/o) - {a s f ,b s f ))(x) > (5.72) 

for sufficiently small s. On the other hand, for x S (—L—l — s, ^L — ^)(J(L + ^, L + l + s), 
by Lemma 15.1.61 we have, 

e sA (l + Ks)b s f Q (x) - (1 + Ks)b s f (x) > (1 + Ks)^s > ^s. 

Therefore by (|5.65|) and the above inequality, for x S (—L — l — s, ^L — ^q)L1(L+^, L+l+s), 
TT v (e sA T°(a f , b f ) - (a s f ,bj ))(x) > e sA (l + Ks)b s f a (x) - b s f (x) 

> (l + Ks)b s f (x) + ^s-b s f (x) 

> 5P S ' 

where the last line is due to the second and third lines of (|5.70|) . Hence for x 6 (-L — I — 
s, L + l + s), 

Trv(e sA F*{a fo,bofo))(x) > b s (fo(x) + -^s 

Then Lemma 15.1.71 implies that there exist positive constants 5\ and 8 2 independent of s 
such that 



^(e sA ^(a / , b Q f Q ))(x) > bj 2 (x), (5.73) 



where f 2 is defined in ()5.60p 



(1 + S 2 s)h(x + L + Sis), x £ (-L - I - 5 x s, -L + l- Sis) 

1 + S 2 s, x e [-L + I - Sis,L - I + Sis] 

(1 + 6 2 s)h(L + Sis — x), x € (L — I + 5±s, L + I + 5is) 

0, otherwise 



Similarly, the estimates in (|5.72|) to (I5.73|) also hold for the w-coordinate of 
e sA J-^(aofo,bofo) — (a s fo,b s fo), if b s on the right hand side of each inequality is replaced 
by a s . So for all x G (— L — I — s, L + I + s), we have 

e sA ^(a Q f ,b fo)(x) > (a s f 2 {x),b s f 2 (x)), 

as required. In particular, (a s ,b s ) in the statement of Proposition I5.1.T1 should be ((1 + 
S 2 s)a s , (1 + 6 2 s)b s ). □ 
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Lemma 5.1.9. (Case 2) Recall that 

Tl'o = {(u,v) G TZ 2 : v + 0.08 < u < il±|^£ _ .04, v > 0.55}, 

1 + 2cv 



(1 4- 2c)t; 

TZ 2 UTZ 3 = Uu,v)e1l:v + 0.02<u< y n ' -0.04 

1 + 2ci> 

/or u € [e, 0.8] and u <2v for v G [0, e)}. 

// {(ao/o(aO, bofo(x)) '■ % G [— £ — /, — L + /]} C K2 U K3 and (ao, 60) G i? 2 , w/iere 7?-2, #2 
and IZ3 are defined in \5. 38\) . i5.39\) . and |5.^0| ] respectively, and fo is defined in (IC 1-3) 
on page \33[ then the conclusion of Proposition \5.1.1\ holds. 

Before we prove this lemma, we observe that the union of all the regions where 
(ao, bo) may lie is (lZ' 1 r\lZo)UlZ' 2 , which is exactly IZo as defined in (|5. 19[) . If (ao, 60) G TZ[ D 
IZo, then the part of the line segment O(ao, bo) (O(ao, 60) consists of values of (ao/o, &0/0)) 
above y — e lies in TZ[. On the other hand, if (ao,&o) G TZ' 2 , then the part of the line 
segment O(ao,6o) above y = e lies in 1Z 2 . But in both these cases, for the part of the line 
segment O(ao, bo) below y = e, if suffices to consider IZ3 = {(u, v) G 1Z : u < 2v, v G [0, e)}, 
because the top tip of O(a ,b ) m the (u, u)-plane lies above the horizontal line v = 0.5, 

(l+2c)v 
l+2cv 



where u < 1 < 2v. The sufficiency of restricting to {(u,v) G 1Z : u < ^^cl" — 0.04} has 
been discussed below (|5.19[) on page[ 

Proof of Lemma 15.1.91 Under this case, the line segment formed by 

{( a ofo{ x ), bofo{ x )) -x£ [— L — /, — i + Z]} lies in 7?. 2 Ui?3, i.e. the portion of the line segment 

above the horizontal line v — e lies in 1Z 2 and right of the line u — v — 0.02, and the portion 

below v = e lies in IZ3. Furthermore, the top tip (ao, bo) lies in 1Z 2 , i.e. to the right of the 

line u — v = 0.08 and above the horizontal line v — 0.55. For x G [-L — j^, L + jqq], we 

have 

bofo(x) > 0.55/o (-L - > 0.5ft [-Jw) > t 

by (|5.63p . Therefore, by Lemma |5.1.5[ we can construct functions g 2 and g^: 

( 6 /o (x) +Ks, x G [-L- j^} 

92{x) = l b fo(x)(l-2s), i6(-I-/,-£- i L)u(L+ I i 5l L + I) , (5.74) 

[ 0, otherwise 

and 

b Q fo(x)+Ks, x G [-L — 2^q, L + ^q] 

//.-.(.-■) = { bofo(x)(l-2s), xe(-L-l,-L-^)U(L + ^,L + l) , 

0, otherwise 

such that both (j^g 2 ,g 2 ) and (^-53,33) are lower bounds of J 7 * (ao/o, oo/o)- Notice that 
93 < 92 everywhere, and 172 has discontinuities at —L — Z / 100 and L + Z/100, while (73 has 
discontinuities at — L — 1/200 and L + Z/200. See figure [5791 for graphs of g 2 and 53. 
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b + Ks 




(a) The function gi 

b + Ks 



-L-l 



-L 



_r L 

200 

(b) The function g% 



-L + l 



Ks 





-L-l -L -L + l 

(c) The function 174 



Figure 5.9: The functions 32, ff3, and 54; dotted lines denote the function bofo- 



Now we construct the function 174: 

9±(x) = ( 9 f\> Xe[ - L -^' L tMr 1 ^> (5-75) 
yyj \ g 3 (x), xe(-(»,- J L- T i ?J ]U[i + T i 5 ,ao) 

furthermore 34 is required to be C°°, monotone in \—L — , —L — j^] U [L + jq-q,L + i], 
and lying between (72 and (73, with 

|A ff4 (x)| < I (5.76) 

everywhere. For a; ^ [— L — 555)— i — j^q] U [L + j^gji + 555]) the last requirement is 
automatic by (|5.5[k but for a; e [-L-^j, — L — y^j]U[L + y^, L + ^\, it can be achieved for 
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sufficiently small s. Notice that since 34 < 52, (§^34,54) is a lower bound of ^(ao/o, &o/o)- 
Furthermore, for x S [— £ — 555, £ + ^q], 

9i(x) -b fo(x) =Ks. (5.77) 

Here (flo/o) ^0/0) is the function to which we compare ^(ao/oi^o/o) to see how much 
"progress" we are making increasing the ^-coordinate. 

We now turn to evolution according to the heat equation. First we deal with x (fc 

~ 2oTp^ + aoTj]- ^ or ^^ s ' we use 9 3 as tne i° wer bound for 7r„(J-jf (ao/o, &o/o))- We 
observe that 173 dominates (1 — 2s)&o/oj therefore monotonicity of the heat kernel implies 

e sA ff 3 > e sA ((l ~ 2s)6 /o) - (1 - 2,s)6 e sA / . 

By Lemma l5.1.61 e sA / (x) > fa{x) + ^ for all x G (—L—l-s, -£- ^)yj{L+ L+l+s), 
therefore 



e sA g 3 (x) > (1 - 2 S )6 (/oW + ^). 



5Z 2 

Since &o > 0.55, we have (1 — 2s)&o > 0.5 for sufficiently small s. Also recall that we pick 
I = in (|5.62[) such that p- = • Thus the inequality above can be written as 

e sA g 3 (x) > (1 - 2s)b f (x) + 0.5-^ = & o /o0r) + (3 - 2b f (x))s. 

0.5 

Finally, since 3 — 2&o/o > 3 — 2 = 1, we have 

e sA g 3 (x) > b fo(x) + s. (5.78) 

for all x S (-£ - / - s, -L - ^) U (£ + ^,L + I + s). 

For x € [— £ — 2^q,L + 2?jg], we use 34 as the lower bound for 7r^(jF^(ao/o, &o/o))- 
By (I5.76p . the heat operator e sA may decrease values of 34(1) by at most i.e. 

e sA g 4 (x)-g 4 (x)>-^s. (5.79) 

Therefore for x e [— £ — -J^,L + tJL], we have 



200' 1 200 J 

2 

where we apply (|5.79|) to e sA g 4 (x) — 34 (x) and (|5.77p to g 4 (x) — bofoix). Thus for x € 
200' L + 200]' we have 

e sA g 4 (x) - bofoix) > -U 



e sA 54 (x) - 6 /o(z) = (e sA <? 4 (x) - 54 (x)) + ( ff4 (x) - 60/0 (as)) > ~8 + Ks, 



since £T is chosen to be larger than 3/Z 2 in (|5.64p . 

The estimates in (|5.78[) and (|5.80p . together with the fact that 5(3 and 34 are lower 
bounds of ^(^(ao/o, bofo))(x), imply that there is a positive constant m, such that for 
x e (— £ - Z - s, L + I + s), 

ir v (e sA ^(aofo,b Q fo))(x) > 6 / (x) + s > 6 (/o(x) + a). 
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As in Lemma 15.1.81 we apply Lemma 15.1.71 to obtain the estimate 

7r v (e sA T°(a fo,b f Q ))(x) > b Q f 2 (x), (5.80) 

where f 2 is defined in (|5.60[) . 

For the M-coordinate of e ,F*(ao/o, &o/o)j we can obtain estimates (15.78)) to (|5.80|) 
if we replace bo on the right hand side of each inequality by oq. So we conclude that for all 
x e (-L - l — s,L + l + s), 

e sA ^{aofo-b fo)(x) > {a f 2 {x),b f 2 {x)), 

as required. In particular, (a s ,b s ) in the statement of Proposition I5.1.T1 should be ((1 + 
5 2 s)a ,(l + S 2 s)b Q ). □ 

Proof of Proposition 15. 1.11 The proposition follows from Lemmas 15 . 1.81 and 15.1.91 and 

the discussion below (|5.19p on page 1381 regarding the sufficiency of restricting the region for 
(a ,6 ) to TZq. □ 



5.2 Upper Bounds: Existence of and D2 in Condition 

(*) 

We establish the following proposition, which, together with Corollary 15. 1.21 verifies condi- 
tion (*) on page[nU As in Corollarv l5.1.21 the "L" in condition (*) is picked to be L — I. 

Proposition 5.2.1. If c is sufficiently large, then there exist constants d 2 < D 2 < 1 and 
T such that if vq(x) < D 2 for x G [— L + I, L — I] then v t {x) < d 2 for x £ [— 3L, 3L], where 
(u t ,Vt) solves the PDE h5.1\) . 

Proof. Because of the monotonicity of the PDE (|5.ip . it suffices to pick a uniform initial 
condition 



«o 

V 



some u, 
D 2 , 



and show that at time T, 



ut = 
vt < 



some u, 
do. 



Therefore we need only concern ourselves with the ODE (|5-2[) . We can bound 772 (u, v) 
defined in (|5.8p for any v > 1 — - as follows: 



772 (u, v) = [c(u — v) — 2)v < 



1 



1 

1 - - 

c 



2 v 



-v < 



1 



if c > 1. Thus for any two numbers D 2 and d 2 that satisfy l>Z?2>d2>l — there 
exists T, such that if v = D 2 , then vt < d 2 . □ 
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Part II 



Stationary Distributions of A 
Model of Sympatric Speciation 



GO 



Chapter 6 

A Model on Sympatric 

Speciation 



6.1 Introduction 

Understanding Speciation is one of the great problems in the field of evolution. According to 
Mayr [Mayr 1963], speciation means the splitting of a single species into several, that is, the 
multiplication of species. It is believed that many species originated through geographically 
isolated populations of the same ancestral species [Dieckmann and Doebeli 1999]. This 
phenomenon is relatively easy to understand. In contrast, sympatric speciation, in which 
new species arise without geographical isolation, is theoretically much more difficult. 

6.1.1 The Dieckmann-Doebeli Model 

Dieckmann and Doebeli [Dieckmann and Doebeli 1999] proposed a general model for sym- 
patric speciation, for both asexual and sexual populations. We will describe their model 
for the asexual population first. Each individual in the population is assumed to have a 
quantitative character (phenotype) x G K. determining how effectively this individual can 
make use of resources in the surrounding environment. A typical example is the beak size 
of a certain bird species, which determines the size of seeds that can be consumed by an 
individual bird. The function K : R — > M + (carrying capacity) is associated with the sur- 
rounding environment, where K x denotes the number of individuals of phenotype x that can 
be supported by the environment. For example, since birds with small beak size (say x\) 
are more adapted to eating small seeds than birds with large beak size (say X2, x 2 > Xi), 
K Xl will be larger than K X2 if the surrounding environment produces more small seeds than 
large seeds. In the Dieckmann-Doebeli model, K x is taken to be cexp(— (x %' ). Moreover, 

K 

every pair of individuals compete at an intensity determined by the phenotypical distance 
of these two individuals. More specifically, an individual of phenotype x\ competes with 

an individual of phenotype x 2 at intensity C Xl _ X2 , where C x = cxp(— #^-). Therefore each 

c 

individual in the population interacts with the environment via the carrying capacity K, 
and interacts with the population via the competition kernel C. 
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Let N x (t) denote the number of individuals with phenotypc x at time t. At any time, 
an individual of phenotype x gives birth at a constant rate, and dies at a rate proportional 
to * ^ i i- e - inversely proportional to the x-carrying capacity, but proportional to 
the intensity of competition exerted by the population on phenotype x, the numerator 
(C * N.(t)) x — ^2yC x - y N y (t) being how much competition (from every individual in the 
population) individuals with phenotype x suffer. In addition, every time an individual gives 
birth, there is a small probability that a mutation occurs and the phenotype of the offspring 
is different from that of the parent; in this case, the phenotypical distance between the 
offspring and the parent is then random and assumed to have a Gaussian distribution. 

Since the number of individuals of a certain phenotype increases via the birth mech- 
anism at a linear rate, but decreases via the death mechanism at a quadratic rate, extinction 
of all phenotypes will occur in finite time with probability one, i.e. N = eventually. For 
large initial populations, however, extinction will happen far enough into the future that 
interesting behaviour does arise before the population becomes extinct. 

Monte-Carlo simulations, shown in figure 16.11 give a fairly good idea of the be- 
haviour of the Dieckmann-Doebeli model for asexual populations. If the initial popula- 
tion is monomorphic (t = 1 in figure [6~Tj) . i.e. concentrated near a certain phenotype Xq 
( y^'n\o) ~ ^x )i then the entire population first moves (t = 30, 100, 200 in figure RTT]) to- 
ward x, the phenotype with maximum carrying capacity. If ac > &k (this includes the case 
ac = oo, i.e. equal competition between all phenotypes), then the population stabilizes 
near phenotype x. But if ac < &k, then the monomorphic population concentrated at 
phenotype x splits into two groups, one group concentrating on a phenotype < x, while the 
other concentrating on a phenotype > x (t = 330,370,400,500 in figure l6~Tj) . In the latter 
case, one can say that one species has evolved into two distinct species. 



t = 1 



t = 30 



t = 100 



t = 200 



t = 330 



t = 370 



/ = 400 



t = 500 



t = 300 
t = 600 




Figure 6.1: Simulation of the Dieckmann-Doebeli model with E = 
VI000, and ac = \/600. 



-50, 50] n Z, a K 



We now give a qualitative description of the Dieckmann-Doebeli model for sexual 
populations. Each individual in a sexual population is assigned three diploid genotypes with 
(say) five diallelic loci each. The first set of loci determines the ecological character x (i.e. 
phenotype in the asexual model). The second set of loci determines the marker trait, which 
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is ecologically neutral, i.e. individuals with different marker traits but the same ecological 
character have exactly the same birth and death rates. The third set of loci determines 
mating probabilities to; if to > 0, then such individuals prefer to mate with individuals of 
similar phenotypes; if m = 0, then such individuals have no preference; and if m < 0, then 
such individuals prefer to mate with individuals of a distant phenotype. In addition, \m\ 
determines the strength of this preference. 

The birth rates and the death rates are calculated the same way as in the asexual 
model; in particular, only information from the first set of loci is used to calculate these rates, 
as this is the only genotype that determines the phenotype of the individual. Dicckman and 
Doebeli considered two cases in their sexual model: 1. mating depends on the ecological 
character; and 2. mating depends on the ecologically neutral marker trait. For example, in 
the second case, individuals with m > prefers to mate with individuals of similar marker 
traits. 

Monte-Carlo simulations show that case 1 of the sexual model exhibits very similar 
behaviour to the asexual model, i.e. speciation if ac < ok and no speciation if ac > <?k- 
A caveat: if ac < &k, then only individuals, who prefer to mate with individuals of similar 
phenotypes, survive after the population splits into two groups. Hence in the end, there 
are two distinct groups of individuals who refuse to mate with individuals from the other 
group. For case 2 of the sexual model, Monte-Carlo simulations indicate that ac < a K is 
not enough for speciation to occur. In this case, ac < cax is needed, where c < 1 is a 
constant. 

As the sexual model exhibits similar behaviour to the asexual model, we will con- 
centrate on the analysis of the simpler asexual model. It is our hope that understanding 
the asexual model will give some insights in explaining the behaviour of the sexual model 
as well. 

6.1.2 A conditioned Dieckmann-Doebeli model 

Although the Dieckmann-Doebeli model for asexual populations is considerably less com- 
plicated than their model for sexual populations, it still seems too complicated for rigorous 
analysis. Thus we will attempt to simplify the model while preserving its key ingredients. 
Henceforth we refer to the Dieckmann-Doebeli model for asexual populations simply as the 
Dieckmann-Doebeli model. 

Before we describe our simplified Dieckmann-Doebeli model, we first introduce the 
concepts of fitness and selection. Selection occurs when individuals of different genotypes 
leave different numbers of offspring because their probabilities of surviving to reproductive 
age are different [Burger 2000] . If we define fitness to be a measure of how likely a particular 
individual produces offspring that will survive to reproductive age, then individuals with 
higher fitness should have higher probability of being selected for reproduction. Along these 
lines, it is natural to define fitness of a phenotype as the difference between the birth rate 
and the death rate of individuals of this phenotype. It is also natural to require the fitness 
function to be bounded between and 1. 

The key feature of the Dieckmann-Doebeli model is that each individual has a fit- 
ness that depends on both the carrying capacity associated with its phenotype and the 
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configuration of the entire population. More specifically, the fitness of a phenotype x is an 
increasing function of K x , the carrying capacity, but a decreasing function of (C * N) x , the 
competition it suffers. Here N x is the number of individuals of phenotype x. 

In the Dicckmann-Doebeli model, the number of individuals can fluctuate with time. 
As mentioned before, since the birth rate is linear but the death rate is quadratic, extinction 
will occur in finite time with probability one, which makes it somewhat meaningless to 
analyze the equilibrium behaviour of the system. We make the assumption that the number 
of individuals N is fixed over time, reflecting a constant carrying capacity of the overall 
population. The mechanism by which we achieve this is to require that death of an individual 
and birth of its single offspring occur at the same time, called replacement sampling in Moran 
particle models [Dawson 1993, Chapter 2.5]. This way, the number of individuals remains 
constant, and analyzing the behaviour of the population is then equivalent to analyzing the 
empirical distribution 



where x n , n = 1 . . . N, denotes the phenotype of the n individual in a population of size 



Before we describe our simplified Dicckmann-Doebeli model, we say a few words 
about our terminologies and notations: we refer to individuals in a population as "particles" , 
and sometimes refer to a phenotype as a "site" . We consider multiple models, both discrete- 
time and continuous-time; for discrete-time models, we use V to denote the fitness function; 
but for continuous-time models, we use m instead. Our simplified discrete-time Dieckmann- 
Doebeli model is as follows: 

1. E = [-L, L] n Z is the phenotype space, and tt, n N £ V{E) is a probability measure 



2. K : E — > [0, 1] is the carrying capacity, and C : Z — > R + is the competition kernel; 

3. V x (tt) is the fitness of phenotype x in a population with empirical distribution 7r 
(sometimes we notationally suppress the dependence on w); we define two possible 
fitness functions below; 

4. A is a Markov transition matrix associated with mutation, with A(y, x) denoting the 
probability of a particle of phenotype y mutating to a particle of phenotype x; 

5. At every time step t G Z + , the entire population is replaced by a new population of N 
particles, each particle chosen independently, according to the distribution p. (t, tt n ): 



In (|6.ip . the denominator J2 Z {t)V z (n N it)) is simply the normalization factor such that 
*}2 x Pxit, 7T ) = 1. In words, at every time step, the entire population dies and is replaced by 




N. 



on E; 




(6.1) 



64 



a new population, each individual x choosing an individual x' from the original population 
as its parent with a probability proportional to its fitness V x > , after which the new individual 
x undergoes mutation according to A. 

We consider two fitness functions: 



V^(n) = . (6.2) 

l_,z ^x-z^z 

Each of the two fitness function defined above is an increasing function of K x and a de- 
creasing function of (C * 7r) x . resembles more closely the original Dieckmann-Doebeli 
model, but it has the disadvantage of being in a more complicated form than and it is 
also not differentiable. 

By Theorem 1 in [Del Moral 1998], which we state below, {tt^, t G [0, T}} {ir t , t S 
[0, T]} as N — > oo , where denotes weak convergence and n t evolves according to the 
following deterministic dynamical system: 

y 2Zz^z{t)Vz{-K(t)) 

Theorem 6.1.1. Suppose E is compact and M is a Feller- Markov transition kernel on 
V(E), i.e. M : V(V{E)) -> V(V{E)). Let AfW = MC N , where C N is a Markov transition 
kernel on V(E) given by 

N 



C N F(ir) = F I i **i j < dx ^) ■ ■ ■ < dx n) 



i.e. a probability measure it replaced with an empirical measure ir N formed by N particles 
chosen independently according to n. Then 



(mw)' 



M r 



as N — > oo . 



This theorem is easy to understand: take n = 1, then it says that the mapping 
MCn converges to M, i.e. changing the input measure of M by an empirical measure of N 
particles makes almost no difference if N is large. 

Analyzing the dynamical system (|6.3p is not easy, partly because it is of a compli- 
cated form that is nonlinear in n, and we cannot find any Lyapunov function that associates 
with (|6.3p . A continuously differentiable function V : U — ► R is called a Lyapunov function 
if V is nondecreasing (or nonincreasing) along orbits. For a discrete-time dynamical sys- 
tem such as (JOJ), this means that V(ir(t + 1)) - V(ir(t)) > (or < 0) for all t > 0. For 
a continuous-time dynamical system, it means that dtV(w(t)) > (or < 0). Simulations 
of (|6.3p . however, seem to display some interesting behaviour, which we will describe after 
carrying out some non-rigorous analysis of 
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Without mutation, any site x with ir x = at any time r will stay for all t > r. 
Mutation enables individuals of phenotype x to be born in future generations even if there are 
no individuals of phenotype x in the present generation. But if we start with a polymorphic 
initial measure, i.e. 7r x (0) ^ for all x, then adding small mutation to the system should 
not cause significant changes in the behaviour of (|6.3p . Therefore we assume that A = I 
and 7r(0) is polymorphic. In this case, (|6 ,3|) can be simplified to 

7Ta;(< + 1) = 



E^(*)W*))' 

Thus if j4 = /, then 7r is a stationary distribution of (|6 . 3|) if and only if 



7Tx = -k x V x (tt) (6.4) 
c 

for some constant c. Condition (16. 4|) is equivalent to 

T4(-7r) = c for all x where 7r(x) ^ 0, (6-5) 

Let K and C be in the form considered by Dieckmann and Doebeli, i.e. K x — exp(— x 2 /2a 2 K ) 
and C x = exp(— x 2 /2a 2 : ). If V = V^ 2 \ then condition (16. 5p means that 

K x = c(C * fr)(x) for all x where n(x) ^ 0, 

which seems to indicate that if ac < <Jk, then % should be close to jV(0, a\ — a c ). On the 
other hand, if V = then 7r is a stationary distribution if 1 — ^-^^ j s a strictly 

positive constant. Notice that if K and C are both Gaussian-shaped with Kq = C'o = 1 
then 7r = jV(0, cr^- — Oq) makes 1 — — ( ^f~ z7rz constant; furthermore, this constant is strictly 
positive since (C * 7r)(0) < K § = 1 if ac < &k- 

Therefore for both and V^ 2 \ the dynamical system (I6.3P should have Gaussian- 
shaped stationary distributions if ac < ax- In simulations carried out by Dieckmann and 
Doebeli [Dieckmann and Doebeli 1999], however, ac < ax is the case that leads to specia- 
tion, i.e. the stationary distribution supposedly has two sharp well-separated peaks, which 
contradicts the analysis carried out in the previous paragraph. Simulations of (|6.3p with 
V = shown in Figure RT2I reveal that if 7r(0) ~ So, initially the population does split 

into two groups and begins to move apart, but as t — ► oo, the empirical measure converges 
to a Gaussian-shaped hump. This suggests the possibility that in the original Dieckmann- 
Doebeli model, conditioning on the population surviving long enough for convergence to 
equilibrium to occur (recall that in the original Dicckman-Doebeli model, extinction occurs 
in finite time), speciation is also a transitory phenomenon, rather than an equilibrium phe- 
nomenon. Simulations of (|6.3p with V — V^ 2 \ shown in Figure [B~3j does not even display 
transitory speciation behaviour. Instead, the initial spike at simply widens to a Gaussian 
hump centred at 0. Hence the particular form of the dependence on K and C * 7r seems to 
affect whether or not speciation occurs. 

From the simulations and non-rigorous analysis above, it seems that the dynamical 
system in (|6.3|) does not have a bimodal stationary distribution if both K and C are taken 
to be Gaussian-shaped. If K and C are taken to be rectangular (i.e. K x = l{|a;|<L} an d 
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t = t = 10 t = 20 t = 30 f = 80 t = 200 




t = 400 t = 800 t = 1200 t = 1600 < = 2000 * = 3000 




Figure 6.2: Simulation of (JOJ with E = [-149, 149] H Z, cr K = 60, ct c = 55, and V = 
t = Q ^ = 40 t = 80 t = 200 t = 400 t = 3000 




Figure 6.3: Simulation of (JO) with S = [-149, 149] n Z, a K = 60, <r c = 55, and V = ^ (2) . 



Ca; = l{|a;|<M} f° r some integers L and M), however, results from Appendix 1X1 shows that 
there exist bimodal stationary distributions. More specifically, Theorem IA.0.141 says that 
if v n is a convergent sequence of symmetric stationary distributions for the conditioned 
Dieckmann-Doebeli model with mutation parameter //" with fx" — > 0, then v"_ t « — > 0, 
where I — M — L + 1; in words, the mass in the middle gets very small as the mutation 
parameter approaches zero. 

6.1.3 A Moran Model with Competitive Selection 

As discussed earlier, the dynamical system (I6.3|) cannot be easily associated with a Lya- 
punov function, which makes analyzing its behaviour difficult. Keeping in mind that the 
essential ingredient of the original Dieckmann-Doebeli model is that the fitness function is 
an increasing function of K x and a decreasing function of (C * 7r) x , we define the fitness 
m x (Tr) to have the following form: 

m x {-K) = K x ^2 B x - z K z ir z , (6.6) 

z 

where the "cooperation" kernel B can be taken to be 1 — C. We assume B is symmetric. In 
the original Dieckmann-Doebeli model, pairs of individuals with small phenotypical distance 
compete at a higher intensity than pairs of individuals with large phenotypical distance; 
in our model, pairs of individuals with small phenotypical distance cooperate at a lower 
intensity than pairs of individuals with large phenotypical distance. To make our formulation 
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cleaner, we also adopt a continuous-time model. The advantage of adopting m x in (|6.6p as 
fitness and using a continuous time model is that the mean fitness of the population 

X x,z 

is a Lyapunov function [Burger 2000] for the dynamical system 

d t n x = ir x (m x - 1,). (6.7) 
This assertion can be verified by the following calculation: 
d t m„ = 2 m x d t Tr x 

X 

= 2 m x w x (m x - m,r) 

2; X 

= 2^TT x (m x - m^) 2 , (6.8) 

a; 

where in the second line we use the fact 

}^m n TT x (m x - m n ) = m 2 - m 2 ^ ^ = to 2 - to 2 = 0. 

x a: 

Since d{m v > for any 7r, m w is a Lyapunov function for the dynamical system (|6.7[) . 
and in particular, the mean fitness m n increases at a rate proportional to the variance 
of the fitness. We call (|6.7p the selection-only equation, as it does not have a part that 
corresponds to mutation. In Chapter l6.2.11 we will derive (|6.7[) as the deterministic limit of 
particle systems as the number of particles tends to infinity. 



6.2 The Particle Model 

We introduce two particle models, one with "strong selection" that yields a deterministic 
limit, and another with "weak selection" that yields a stochastic limit. We work on space 
E = [-L, L] n Z. Let 

L 

A = {(p-i, ■ • • ,Po, ■ ■ ■ ,Pl) ■ Pi > Vt and p t = 1} 

i=-L 

be the space of probability measures on E, i.e. A = V(E). Members of A are usually 
denoted by n, it, tt n , etc. We endow A with the following metric: 

dOk, 7r) = max |7r(a;) — 7r(cc) | . 

X 

Let K : E — > [0, 1] be the carrying capacity function, and B : Z — > [0, 1] be the cooperation 
kernel, with B z = meaning that sites separated by phenotypical distance z do not coop- 
erate at all (i.e. compete at full intensity), and B z = 1 meaning that they cooperate at full 
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intensity (i.e. do not compete at all). We assume B to be symmetric. The fitness of site x 
in a population with distribution 7r is defined as 

m x (n) = K x ) j B x - z K z ir z . 

z 

If one abuses notation by writing if as a diagonal matrix, B as a matrix, and 7r as a vector, 
then the vector formed by m. (n) can be written as KB Kir. The mean fitness of a population 
with distribution 7r is defined as 

rn-n = ^tt x K x B x ^ z K z tt z . 

x,z 

If one abuses notation again, then rn v can be written as a quadratic form ~k 1 KBK'k. 

Throughout this work, we will use symmetric or house-of-cards mutation, which 
means that the rate \i xy — [i y at which phenotype x mutates to phenotype y depends 
on y only. This is a common assumption in population genetics [Burger 2000], and it is 
precisely this assumption that allows one to explicitly write down a Lyapunov function for 
the selection- mutation equation (to be defined in (|6 . 1 Of) ) - As a further simplification, we 
assume that ji y = fi is constant in y, which makes the proofs a bit cleaner. Let X N (t) = 
(Xf r (t),...,X$(t)), t e R+, be an TV-particle system, with Xf(t) € E for all t and i. 
Define the empirical measure 

1 N 

i=l 

6.2.1 The Strong Selection Model 

For our model with strong selection, the particle system undergoes the following: 

• Selection: At rate Nrfi^N , a particle, say X^ , is chosen at random from the N particles 
and killed; at the same time, a new particle is born at x with probability mx l^ ^* . 

Since y~L T " x _ ^ = 1, m "_ — is a probability distribution; 

• Mutation: At rate N(2L + a particle, say X(* , is chosen at random from the N 

particles and killed; at the same time, a new particle is born at a site y with probability 
l 

2L + 1 - 

A particle at x gets replaced via selection by a particle at y at rate Nir x m y (ir N )iry , and 
gets replaced via mutation by a particle at y at a rate of Nfnr x . Let K — sup^g^^ n K x , 
so that m x (Tr) — K x J2 Z B x - z K z tt z < K'^2 z Ktt z = K 2 . Let l(dx) denote the Lebesgue 
measure on R + . The process described above can be constructed using a Poisson point 
process A N (dt, dx, dy, d£, de) on 

R+x{(x,y)eE 2 :x^y}x [0,1] x {1,2}, 

with intensity measure 

X N (A xBxCxD) = l(A)(#B)(#C)k(D), 
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where # denotes the counting measure, D C [0, 1] x {1, 2}, and k — I x (NK 2 Si + NfiSz). 
For all x,y € E 2 with x ^ y, jumps of A N (dt, x, y, [0, 1], {1}) give possible times at which 
a particle at y may be replaced by a particle at x by the selection mechanism, while jumps 
of A N (dt, x, y, [0, 1], {2}) give possible times at which a particle at y may be replaced by a 
particle at x by the mutation mechanism. The strong selection model can be expressed in 
terms of the following formula for ir x (t) : 



»~(<) = >r?(0) 



1 

'n 



i {< 



K 2 
K 2 



A N {ds,x,dy,d£_,l) 
A N (ds,dy,x,d£,l) 



l^<^( S -))A N (d Sl x,dy,dt2) 
l^<^(s-))A N (ds,dy,x,d^,2) 



(6.9) 



A solution to (|6.9[) exists because the total jump rate is finite for a fixed N. The two integrals 
inside the first set of brackets corresponds to selection, i.e. a particle at x gets replaced by 
a particle at y at rate Ntt™ m x (ir N )Tr x due to selection. In particular, A N (ds, x, dy, d£, 1) 
in the first integral accounts for the killing of a particle at y and a new particle being born 
at x, and A N (ds, dy, x, d£, 1) in the second integral accounts for the killing of a particle at 
x and a new particle being born at y. The two integrals inside the second set of brackets 
corresponds to mutation, i.e. a particle at x gets replaced by a particle at y at rate N 
due to mutation. 



N 



Proposition 6.2.1. As N — > oo, the processes ir N converge weakly to a deterministic 
process tt that takes values in V(E) and obeys the following system of ODE's: 



dt^x = n x (m(x, tt) - m w ) + fx(l - (2L + l)n x ). 



(6.10) 



Proof. First, we rewrite (|6.9|) by decomposing A^ into a martingale term A N and a deter- 
ministic drift term: 



y=-L 



-^(s-)m y (w N (s-))^(s-)]ds + f, /*[%(*-) 

.._ r JO 



ds 



y=-L- 



Jo 

+ f i[l-(2L+l)n x (s-)] ds 



(6.11) 
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where we define A = A — A to be the martingale part of A and 



M»(t) = - 



1 K< 



1 

"n 



/ <(^)m g (^( S -))^( S -) 
^ " K 2 

n^(s-)m y (n N (s-))n^(s-) \ 
K 2 ) 

l(ti<n»(s-))A N (ds,x,dy,d?,2) 
l^<^(s-))A N (ds,dy,x,d^2) . 



jA N (ds,x,dy,d?,l) 
A N (ds,dy,x,d£,l) 



We estimate the quadratic variation of the martingale term (t) and show that it con- 
verges to as N — > oo: 



1 

N 2 E 

y=-L 

1 L 

+ JV2 E 

y=-L 







t 1 ^ <(Q^(^( S -)) 7 rf( S -) 



NAT 2 ds 



NK ds 



l(Z<K(s-))Nflds + / l(£<7l£(«-))iV/idfl 



< 



< 



2X 2 

AT 



E 



2^ 

TV 



<' - - E 



ds 



2{K 2 + n)(2L+l)t 
N ! 



(6.12) 



which — > as A^ — > oo. Since the maximum jump size in M x is -j^, by Burkholder's 
inequality (see for example Theorem 21.1 in [Burkholdcr 1973]), we have 



t<T 



E [ ( supMf (t) ) ) < C ( £ (Mf > T + — 



1 







(6.13) 



as AT -> 0. The other two terms in ([5TTT]) . i.e. n£ (0) and J * 7rf (s-)[m :E (7r A '(s-)) - 
ra f »( s _)] + /i[l — (2£ + l)7r a (s— )] ds, are both C-tight, 7r^(0) being constant and the 
integrand in the integral bounded uniformly between constants. Therefore ir x is C-tight for 
each x. 

If there exists a sequence N n such that ir N ™ converges to ir weakly and it is contin- 
uous, then by bounded convergence, J Q w^ n (s— ){m x (n Nn (s— )) — m w N n ( a -)] + /i[l — (2L + 
l)7Ta:(s— )] converges to 



7r x (s)[m, x (7r(s)) - m,(,)] + /i[l - (2L + 1)tt x (s)] ds, 
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and since ir N has representation (|6.11[) , tt satisfies the following deterministic integral equa- 
tion: 

7Tx(t) = 7^(0) + f n x (s)[m x (w(s))-m n{s) ]+ f j,[l-(2L + l)7T x (s)]ds. (6.14) 
Jo 

A continuous (n(t), < t < oo) solves the integral equation (16.14j) if and only if it satisfies 
the ODE system (|6.10[) . By well-known results from ODE theory (e.g. Theorem 1.1.1 from 
[Wiggins 1988]), the solution to (|6. 10|) is unique because its right-hand-side is C°° in tt. 
Therefore solution to (|6.14p is unique as well, and the proof is complete. □ 



6.2.2 The Weak Selection Model 

For our model with weak selection, the particle system with N particles undergoes the 
following: 

• Selection: A particle at x gets replaced by a particle at y at rate Ntt^ m y (Tr N )iTy ; 



Mutation: A particle at x gets replaced by a particle at y at rate N jiu 



N. 

x > 



• Replacement sampling: A particle at x gets replaced by a particle at y at rate 

N± n N N 
2 "x "y ■ 

Just as in the strong selection model, this process can be constructed using a Poisson point 
process A N (dt, dx, dy, d£, de) on 

K+ x {(x, y)eE 2 :x^y}x [0, 1] x {1, 2, 3}, 

with intensity measure 

\ N (A xBxCxD) = l(A)(#B)(#C)k(D), 

where I denotes the Lebesgue measure on R + , # denotes the counting measure, D C [0, 1] x 
{1,2,3}, and k — I x (NK 2 5\ + NpL&2 + ^-S^). The weak selection model can then be 
expressed in terms of the following formula for ir^ (i) : 

= 7^(0) 
1 

+ N 



K 2 



1 «< 



n»( S -)m y (n N (s-))n»(s-) 
K 2 



A N (ds,x,dy,d£_,l) 
A N (ds,dy,x, d£, 1) 



1 

'N 



1 

'N 



l(C<<(s-))A iv (ds,x,dy,de,2) 



l^<^(s-))A N (d S ,dy,x,d^,2) 
1« < ^( S -)^( S -))A N (ds, x, dy, d£, 3) 
1(C < ^( S -)<( S -))A JV (d S ,dy,x,^,3) 



(6.15) 
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The two integrals inside the first set of brackets correspond to selection, those inside the 
second set of brackets correspond to mutation, and those inside the third set of brackets 
correspond to replacement sampling. By carrying out computations similar to those done in 
the proof of Proposition W.2. II (and the tightness proof in [Perkins 2002]), one can conclude 
the processes {ir N : N g N} is C-tight, and that each weak limit point ir satisfies the 
following martingale problem: 

n x (t) = n x (0) + / ■n x {s)[m x (n(s)) - m n(s) ] + fi[l - (2L + 1)7^(3)] ds + M x (t), 
Jo 

where M x is a continuous (^ r t 7r )-martingale such that M x (0) = 0, and 

(M Xl M y ) t = / 5 xy ir x (s) - Tr x (s)ir y (s) ds. (6.16) 
Jo 

Here 5 xy = l(x = y). Ito's formula shows that for F 6 C 2 (E), 

F{n{t)) - F(tt(0)) - / GF(tt(s)) ds 
Jo 

is a bounded continuous martingale, where 

L 

GF(ir) = I^KW-%)+ p(l - (2L + 1)tt. 



L dF_ 



Y L L 



d 2 F 



2 x t^ L y~^L " dir x dn y 

In particular, if F takes on the form F(ir) — f((ir, (/))) where (ir, <j>) =J2 z 7Tz 
is a function, then 



QF{-k) = J2 f(^^))M^(m x (n)-m 1T ) + fi{l-{2L+l)TT x )} 

x— — L 

L L 

+ 2 E E /"(M>)<^te(<W-%). (6.17) 

x——L y——L 

This is a special (finite E and symmetric mutation) case of generator for the Fleming- Viot 
process with selection. The martingale problem associated with Q has a unique solution 
(see Chapter 10.1.1 from [Dawson 1993]), so tt n converges weakly to ir on D(1R + , M\{E)). 

For the process described by the martingale problem (|6.16[) . Lemma 4.1 from [Ethier 
and Kurtz 1994] says that 



v(dn) = C ^ Yl n *J e " l7r dn -L ■••dvL 



is the unique stationary distribution. Here C is the normalizing constant such that v is a 
probability measure on V(E). Notice that if m x {it) = for all x and w, then = for 
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all 7r, and in that case, vid/rr) — C \T\x=-L n, ^J d/K-L ■ ■ ■ d/Kz is the unique stationary 
distribution of the Fleming- Viot process on E with symmetric mutation [Ethier and Kurtz 
1981]. Adding selection to the model has the effect of putting weight e m,r (where m w is the 
mean fitness of n) on the phenotypical distribution tt. But even the least fit tt has weight at 
least 1, since exp(min,r m w ) = e — 1, and the fittest it has weight at most e. The effect of 
fitness on the density at a particular phenotypical distribution n is thus only marginal. In 
contrast, as we shall see later, fitness will have a much more pronounced effect in the strong 
selection model. For the rest of this part of the thesis, we deal with the strong selection 
model outlined in Chapter 16.2.11 
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Chapter 7 

The Selection-Mutation 

Equation 



As discussed in Chapter 16.2.11 the particle model with strong selection converges weakly to 
the selection-mutation equation as the number of particles N tends to infinity: 

d t TT x = TiK -Tn-n) + _ (2£ + l)ir x ), (7-1) 

where, as before, m x — m x (n) is the fitness of site x in population it, and m v is the mean 
fitness of the population: 

m x = K x ^2B x _ z K z ir z (7.2) 

z 

■■ y^xrrix = ) J ir x K x B x - z K z 'Kz- (7.3) 



First, we state a few assumptions on the parameters involved: 

1. x e E = [-L, I]nZ = {-L, . . . , — 1, 0, 1, ... , L}, 

2. K:E^ (0,1], 

3. B : 1^ [0,1]. 

We establish a few basic facts about the system (|7.1I) . First of all, m, (uniformly in x) and 
to w both lie in [0, 1], therefore m x — > — 1. Thus 

= ^ (rrij, - m„) + - (2L + 1)tt x ) 
> -tt x + - (2i + 1)tt x ) 

= |« - (2i/i + n + 1)tt x . (7.4) 

If tt x < 2 Lfj.+fj,+i ' then <9t7r x > regardless of 7r. This means that there can be no stationary 
points of the system (|7.ip with any ir x < jLJI+JI+i^ anc ^ furthermore, for those x where 
ir x < 2 (2Lfl+fj,+i) initially, n x will increase at a positive (bounded away from 0) speed and 
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eventually ir x > 2 (2L^+^+i) ^ or au x ' Secondly, since the sum over all x of the right hand 
side of (|7TTj) is 

^ ^(m^ - m w ) + ^(1 - (2i + 1)^) 

= 53 n x m x -m w }^7T x + (2L + - fi(2L + 1) 5""Vz 
= m v - m OT + (2L + (2L + 

= 0, (7.5) 



we have 



<9 t 53 7T X = 0, 



hence the total mass Y2 X n x remains constant. This, together with the first observation we 
just made, imply that if at t = 0, 7r(0) is a probability measure, i.e. Y] tt x (0) — 1, then 
ir(t) remains a probability measure for all t. 

Since ir x will become instantly nonzero at x where tt x = initially, we restrict our 
attention to polymorphic initial conditions, i.e. 7^(0) > for all x, which is equivalent 

o 

to saying that 7r(0) lies in the interior A of the set A. For polymorphic initial conditions, 
n x (t) > for all x and t, and therefore (|7.ip can be written as: 

d t ^x = ftx ( m x - m OT + /x(2L + 1)J . (7.6) 



Furthermore, there is a Lyapunov function V v : A — > K for the dynamical system (|7.6|) : 

V„ = -rrin + /i^logTTa,. (7.7) 

X 

Notice that 9 7rx m 7r = 2^2 iz K x B x - z K z 'k z = 2m x . The assertion that V n is a Lyapunov 
function for (|7.6[) can then be verified by the following simple calculus exercise: 

X 

= 53 (j rix + nx i^ 71 '* + ~ ™ n ~ + 1 

= y2(m x -\ )ir x (m x -\ f%- /u(2Z + 1 

— (m^ + /i(2L + 1)) 53 7T2, I TOj, H — — n(2L + 1) 

Notice that J2 X tt x (m x + ^- - - ^(2L + 1)) = by ([73]) . therefore 

3s V* = J3 ^ ( m * + ~ ~ m * ~ ^ 2L + 1 )) - °' 
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hence V w is a Lyapunov function for (|7.6[) as claimed. 

In fact, according to Theorem A. 9 of [Burger 2000], (|7.6p is a so-called Svirezhev- 
Shahshahani gradient system with potential V n as defined in (|7.7p . i.e. dtir = VV(ir), 
where VV(ir) = G 7r VV A (7r) and is the matrix formed by entries g xy — n x (S xy — ir y ). 
Any gradient system, such as (|7.6[) . has the property that all orbits, regardless of initial 
condition, converge to some point in the w-limit set 

Duj = {p '■ V is an accumulation point of ir x (t) as t — > oo}. 

All points in are stationary points of (|7.6j) . Since it is a stationary point of (|7.6p if and 
only if 

rh x -rn jT + fj, - (2L + 1)^=0 for all x e E, (7.8) 

where we write rh x = m x (ir), all points in D u satisfies (|T. 8|) . We observe that if tt is a 
stationary distribution and m x > rh y , then condition (17. 8|) means that 4- - (2L + 1) < 
4 — (2L +1), therefore ^a, > 7r u , i.e. 

~y y 

m x > rhy 7r s > 7r„. (7.9) 

In words, fitter sites have more mass. 

We will try to characterize the stationary points of the dynamical system (|7.6|) for 
K and B satisfying the following conditions: 

K is symmetric and unimodal with Kq = 1, and B is of the form 

B x = b + (1- &)1{| !C |>m} with b e [0, 1] and L < M < 2L. (7.10) 

Define I = M — L, then for x G [— I + 1, 1 — 1], the cooperation intensity between x and any 
other site z G [— L, L], B x _ z , is equal to 6, which means that 

to* = K x ^ B x - z K z ir z = bK x ^ K z tt z . (7-11) 

7.1 Mild Competition: 6 close to 1 

If b = 1, then B x = b for all x, hence there is equal competition between all sites. This 
actually means that competition plays no part in how fit site x is and m x is proportional to 
K x . Therefore, since K x is unimodal (hence K x is strictly increasing in [— L,0] and strictly 
decreasing in [0, L]), the fitness should be unimodal, too. Recall from (|7.9[) that stationary 
distributions of (|7.6I) has the property of fitter sites having more mass, thus we expect the 
stationary distribution it to be unimodal as well. In particular, it should attain its maximum 
at x = 0. As [i — > 0, we expect the "peak" of tt concentrated around to become sharper 
and sharper, approaching Sq, the 5-measure concentrated at 0. In fact, as we shall see, b 
only needs to be somewhat close to 1 for this behaviour to occur. 

We now show that for any stationary distribution it of (|7.6|) , site is fitter than any 
other site for b G (|, 1] sufficiently close to 1, i.e. too > rh x if x ^ 0. This will mean that as 
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H — * 0, any it approaches the (5-measure. Recall from (|7.1ip that for x E [—1 + 1, 1 — 1], we 
have 

m x = bK x K z w x . 

z 

We recall that Kq = 1 and K x is assumed to be strictly increasing in [— L, 0] and strictly 
decreasing in [0, L], therefore K x attains its maximum at x = 0, and thus for x E [—1 + 
1, -1] U [1,1- 1], K x - Kq <K 1 -K <Q. Therefore for x E [—1 + 1, -1] U [1,1- 1], 



m x - to 



= b{K x - K ) K z ir z < -b{K - K x ) Y K z tt z 



We now apply the bound J2 Z ^z^z > inf^sA J2 Z ^z^z = Kl to the above inequality to 
obtain 

m x - to < -b(K - Ki)K L . 
Since b is assumed to be > |, we have for x E [—1 + 1, —1] U [1, 1 — 1], 

m x -m < ~{K Q -K X )K L . (7.12) 
We also bound the fitness for sites x in [l,L]: 

m x = bK x ^K z TT z + (1 - b)K x Y K z^z 

z \x-z\>M 

= bK x 22 K z tt z + (1 - b)K x 2_j K zK z since x > I 

z x — z>M 

x-M 

< bK x Y K z n z + (1 - b)K x Y K z since tt z < 1 

z Z =—L 
-I 

< bK x Y K z n z + (1 - b)K x Y K z 

z Z =—L 

< bK x YK z n z + (l-b)(L-l + l)K x K h (7.13) 

z 

where in the last line, we use the following: for z E [— L,x — M] c [-L, —I], K z < K-i = Ki. 
Similarly, for x € [-L, —I], we have the same bound (|7.13p . Therefore for x E [-L, —l}U[l, L], 
we have 

m x < bK^K^ + il-b^L-l + l^Ki 

Z 

< bK x YK z ^z + (l-b)(L-l + l)Kf, (7.14) 

z 

where again we use the bound K x < Ki for x E [— L, — I] U [l,L]. We use 1 1[) and (|7. 14|) 
to estimate m x — Too for x E [-L, —I] U [I, L]: 



m x - to 



< bK x Y Kz7t z + {l-b){L-l + 1)K? - bK Y K z*z 



b(K x - K Q ) Y K z*z + (1 - b)(L - I + 1)K, 
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Since K x — Kq < for x £ [-L, —I] U [I, L], we have 

m x ~m < -b inf (K - K x ) inf V K z n z + (1 - 6)(L - 2 + l)i\Tf 

x£[-L,-i]U[i,I,] tt£A*-^> 

= -6(^o-^)^L + (l-^)(i-^ + l)^ 2 - 
Since 6 is assumed to be > ^, the above bound can be simplified to: 

m»-mo<(l- b)(L - I + 1)K? - ^(K - K t )K L . (7.15) 



Thus if b is so close to 1 that 



(Kg - X,)Jf £ 

4(L- 1 + 1)1^ 



i-6< '^ 2 , (7.16) 



then let 5i = j(-fQ) — be a positive constant and we have 

(1 - 6)(L - I + l)K? < ^(Ko - Ki)K L = ^(K - K,)K L - fc. (7.17) 
Thus if condition (17716]) holds, then (f7~15|) and (f77T7|t imply that for x € [-£, -Z] U [Z,L], 

TO3 — wo < — (5l . (7-18) 

Define 5 — min (5\, \ {Kq — K\)Kl), then the estimates in (|7.12[) and (|7. 18[) mean that for 
all x £ E\{0} and any 7r, 

™£c — Too < —S- (7-19) 

This shows that for b satisfying condition (|7.16|) and for any 7T, site is fitter than any other 
site. We will use this bound to establish the following. 

Theorem 7.1.1. If K is symmetric and unimodal with Kq = 1, and B x = &+(l — &)1{| k |>m} 



with L < M < 2L, l = M - L, and b £ 



, i/ien as /! — * 0, 



sup{||-iT M — <5o||oo : is an stationary distribution for mutation parameter /i} — ► 0. 

Proof. Recall condition ||7.8|) : ir^ is a stationary distribution of (|7.6p if and only if for all 
x £ [—L,L], 

rh£-rfh» + fi (-^ - (2£ + l)J = 0, (7.20) 



where we write = m^ir^). We make the following observations using condition (|7.20l) : 
if rh'i > m^v, then — Jp- + (2L + 1) > 0, which implies that 7r£ > 1/(2L + 1); similarly, if 
rrij. < then 7r£ < l/(2£ + 1). 

Since 1/tt x 1 > 1, the following bound holds for all x: 

m£ - m # M + m(1 - (2L + 1)) < 0, 
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which implies that 



to£ - < 2L(i. (7.21) 

We consider [i small enough such that 2L/i < | , where S is defined right after (|7. 18|) . Then 
the estimate (|7.2ip applied to x = means that 

TOq — < 2L/i < — , 

which implies that 

Applying the above to estimate (|7.19p , we get, for all x ^ 0, 

rh^.<rh^-8 < to # m - -, (7.22) 

In particular, the only x where mff. > is x = 0. 

Using the bound (|7.22|) , condition (|7.20p implies that for x ^ 0, 

Ki" (2i + 1) )-5- 

Therefore 

^>^- + (2L + l), 

which — > oo as /i — > 0. Hence 7r£ — > as /i — > for all x ^ 0. Notice that the proof does 
not depend on which 7r M we pick, therefore we are done. □ 

Remark 7.1.2. The crucial estimate for the above proof is \7.19fy . In the case of equal 
competition, i.e. 6=1, it is very easy to derive \7.19[) : 

m x - to = b(K x - Kq) V* K z n z < -b inf {Kq - K x ) inf V" K z t: z 

z z 

= —b(Ko — Ki)Kl, 

which is a positive constant independent of \x. 



7.2 Intense Competition: b close to 

Results from Chapter 17.11 show that if the competition between pairs of sites that are far 
away from each other is not intense, i.e. b is close to 1, then as /i — > 0, the stationary 
distribution(s) converge to 5q, and therefore, there is no speciation. In this section, we show 
that if there is the most intense competition between pairs of sites that are far away from 
each other, i.e. 6 = 0, then we do see speciation in the stationary distribution(s). More 



80 



interesting behaviour arises in the case of positive but small b. We will show that, in this 
case, if /x is small enough, then there are at least two vastly different stationary distributions, 
one resembling the <5-measure, the other bimodal and having almost zero mass in the middle; 
on the other hand, if fi is sufficiently large, then all stationary distributions are bimodal 
and have little mass in the middle. Thus for small //, whether speciation occurs eventually 
in the dynamical system depends on the initial state of the system. But for large enough /i, 
speciation will occur eventually if one waits long enough. We first illustrate this behaviour in 
a system with 3 phenotypes {—1, 0, 1}, whose stationary points we can calculate explicitly. 



7.2.1 Study of A One-dimensional System 

Here we will take the simplest possible scenario, and show that the dynamical system (|7.6p 
has exactly two stable stationary points. Let L = 1, E = { — 1,0, 1}, Kq = 1, and K\ = 
K-\ = i. Let B x = b + (1 — &)l{|a;|>2}j i- e - phenotype —1 cooperates with phenotype 
and itself at level b (i.e. some competition), and cooperates with phenotype 1 at level 1 
(i.e. no competition). We only consider symmetric distributions, i.e. 7r_i = tti. Taking into 
account that 7r_i + itq + 7Ti = 1, the dynamical system (|7.6[) has only one variable, say ttq. 
The fitness of the 3 sites in E and the mean fitness are: 

rn = K b(K ir + 2Knn) = b(ir + m) 
toi = K^bKoTTa + (1 + b)Knn) = ^{bno + 
— , , n n l+b . 

TO X = 67r (7To + TTlJ + 7ri(b7r H — TTl) 

Thus the dynamical system (I7.6[) with variable ttq can be written as a single ordinary dif- 
ferential equation: 

<9 t 7r = 7To(to - m w ) + M(l - 37r o) 

= 7T (b(n +7Ti) - &7r (7r +7Ti) - 7Ti(&7r - ~^~ 7T ^ + MO- _ 37r o) 

Substituting in m = h(l — no), we get 

dtno = — ^ + Trg + ^tt + fi. (7.23) 



Now we take b = i, then (|7.23p can be simplified: 



<9 t 7r = -t^ttq + T^o ~ ( 7^: + 3 A* ) tto + A 4 



3 3 1 2 / 1 
7TX + -7Tn - — 

20 5 V 20 

3 ( 4 2 /l \ 20 
= -20^o-3-o 2 +(3+20mJ 7 to- T M 

We define the polynomial p(x) = a; 3 — |x 2 + (| + 20/j).t — ^jti, then p has roots 
cci = |, X2 = |(1 + \A — 80/1), and X3 = |(1 — VI — 80/1)- Notice that the root Xi does 
not depend on /i, although this is only true for b = |. For other 6's, all roots depend on /x. 
Two plots of — ^kp(x) are shown in figure 17.11 
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For fj, < there are three real roots, as in figure [7TT aV In this case, if 7To(0) < |, 
then 7To(t) — * £3 ~ 0.16 as t — > oo; since 7r__i = 7Ti and 7r_i + ttq + 7Ti = 1, 7r_i = 7Ti — ► 0.42 
as t — > oo. This distribution has much larger mass on sites —1 and 1 than on site 0, thus 
we can say that speciation occurs eventually. But if 7ro(0) > |, then 7To(t) — * £2 ~ 0.84 
as i — > 00 , then 7r_i = 7Ti — > 0.08. This distribution has much larger mass on site than 
on sites —1 and 1, and we say that speciation never occurs. On the other hand, if /i > 
there is only one real root, as in figure (|7.1f b~). In this case, regardless of initial condition, 
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Two plots for b = jq are shown below. For sufficiently large /i, e.g. \x = in 
figure I7.2f b) , regardless of initial condition, n converges to a configuration with little mass 
in the middle (approximately 0.037), i.e. speciation occurs. But for fj, sufficiently small, 
e.g. [i — T^jQ in figure [7T27 ah there may or may not be speciation depending on the initial 
condition. The time evolution of ttq with different initial conditions for both /z = and 
H = t^tt are also shown. 




50 100 150 , 200 250 300 350 50 100 150 , 200 250 300 350 



( c ) A* = 555 (d) M — 

Figure 7.2: One dimensional case: b = jk 
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7.2.2 Large enough [i 

Analysis of the dynamical system (|7.6|) in its simplest form in the last subsection shows 
that if \x is small, there may be two vastly different types of stationary points for (|7.6[) . In 
subsections l7.2.3l and l7.2.iI we establish this for (|7.6p in its general form. But in this section, 



we examine the behaviour of (|7.6p when /i > 4J ^ rs^I) an( ^ establish that all stationary points 
of (|7.6p are bimodal. 

We maintain the assumption in (|7.10[) that K is symmetric and unimodal with 
K = 1, and B is of the form B x = b + (1 - b)l^ x \> M} with b G [0, 1] and L < M < 2L. 
We will need a uniform (in lower bound on . We first establish a crude lower bound 
on m^n that does depend on \x. Recall from (|7.8p that fr' 1 is a stationary distribution if and 
only if for all x G [— L, L], 

ml - + (i (J^ - (2£ + 1)^ = 0. (7.24) 

Therefore 

jtx ( — — (2L + 1) ] = mjf — < < sup m x (n). 
Since K and £? lie in [0, 1], m x = K x ^ z B x _ z K z tt z < 7r 2 = 1, and therefore 

/i^-(2L+l)j < 1, 

which means that 

K > " r > M- (7-25) 

M 

For x G [I, L], we have (recall the first three steps of (|7.13p ). 

x—M 



= bK x J2 K z tt z + (1 - b)K x 



=-L 
c-M 



> bK^ K z ii z + (1 — b)KL ^ z 7Tz since K x is decreasing in [0, L] 

z Z =—L 

> bK 2 L + (l-b)Kl 

= K 2 L . (7.26) 

Since K and i? are both symmetric, the same bound applies if x G [— L, —I]. Therefore (|7.25p 
and (I7.26P imply the following estimate of the mean fitness m^: 

l -I 
= J2 m ^^x >J2 mx ^^ + m x {^)^ 

X x—l X — — L 

L 

> 2j2K 2 L fi = 2Klii(L-l). (7.27) 
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This crude lower bound on ra%n depends on fj,, but in Lemma 17.2.11 we will improve it such 
that it does not depend on /x for sufficiently small b. For now, we use it to establish an 
estimate on 7r A '([— / + 1,1— 1]) that will be needed for the proof of Lemma 17.2.11 below. 
Condition (|7.24j) implies that for x € [-1 + 1,1-1], 

H f-L - (2L + l)j = fn^ - m£. (7.28) 

For x G [— I + 1,1 — 1], J2 Z B x -zK z tt z — bJ2 z K z ~k z is constant, thus the maximum fitness 
is attained at x — 0, with 



m 



(tt) = bK J2 K z n z < b, (7.29) 



since K x is increasing in [— L, 0] and decreasing in [0, L], If b < 2K\[i(L — I), then we can 
apply the estimate (|7.27[) on m^, and bound the right hand side of (|7.28[) : 

to#m — ihff. > m#/j — rriQ > 2K\\i{L — I) ~ b, 

which is > if b < 2Kln(L-l). Thus for b < 2K\ii(L-V), (f7^5|) implies that ^-(2L+1) > 
0, i.e. 

th; < 1 



2L + 1 

Hence we can bound the mass in [— £ + 1, 1 — 1]: 

^([-i + l,f-l])<^±l. (7 .3 0) 

Before we state the theorem of this section, we establish the following lemma, which improves 
upon the bound (|7.27[) such that it does not depend on //: 

Lemma 7.2.1. For fi and b in the region R\ = {(/u,, b) : < 6 < vahi{A^K\{L — 
I), 4 (2L+i) 3 )}' there is a positive constant c\ that depends on L, I, and K but not on /i, 
such that rn%v > c\ for any stationary distribution tt^ . 

Proof. We notationally suppress the dependence of tt**, to^, and m^n on fi. Suppose that 
7r x TT y < S for all x e [-L, -I] and y G [x + M, L], (7-31) 

where 

is a positive constant if (/i, 6) € The pairs (x, y), with x G [— L, — I] and y G [a; + Af, L], 
are exactly those that contribute weight 1 to the calculation of the mean fitness as defined 
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in (|7.3p , while the pairs (x' , y') with x' 6 [-L, —I] and y' 6 [— L, a; + M — 1] contribute only 
weight b. Condition (|7.31|) implies that 

ma = ^ K x K z tt x tt z + b ^ K x K z tt x tt z 

\x-z\>M \x-z\<M 

< n E s + b E K o 

\x-z\>M \x-z\<M 

< 2{L-l + l) 2 5 + 2b{2L+l) 2 

< 2{2L+ l) 2 {S + b). (7.33) 

Recall from (TT30|) that tt([-Z + 1, I- 1]) < |£±±-, which is equivalent to n([—L, —I] U [Z, L]) > 

. Therefore either 7r([-L,-Z]) > or ttQZ.L]) > jj^. Suppose w([l,L]) > 
then we can bound the fitness of site — L: 



L L 

z — — L z—l 
L 



m_ L = K_ L J2 B_ l _ z K z tt z > K_ L £ K z 



> Kl^z=Klm,L])>Kl^—L (7.34) 



•2L + 1 



Since 5 + b = ^L+iyi ■ f 7 "^ 3 ! and (fT34)l together imply that 

m_ L - m # > K\±f±- - 2(2L + 1) 2 (5 + 6) = 0. 
As a result of the inequality above, condition (17.24p implies that 

' (2L+1)<0, 



7T-L 

hence 

*-L > ^Vl- (7-35) 
Combining the two bounds (|7.34|) and (|7.35[) on 7t_l and m_j,, we have 

= E ^ 7r_LTO-L > K l + l) 2 ' (7.36) 



By similar reasoning, if 7r([— L, — Z]) > jfjjj, then we have mx, > 2 l+i ano - ^ ^ 2L+1 ' 
which also implies that rn% > K\ ( 2 l+i) 2 • Therefore condition (|7.31[) implies that m# > 
if£ ^l+i)^ • This would actually contradict (|T.33[) for sufficiently small 5 and 6, but it does 
not matter since in that case, it just says that condition (|7.3ip is impossible and the analysis 
below applies. 

On the other hand, if condition (|7.31[) is not satisfied, i.e. there is at least one pair 
of phenotypes, say (x, y), with x G [-L, —I] and y S [x + M, L], such that TT x Tfy > S. Then 
it is easy to see that 

m*> K x K z n x n z > K 2 n x 7T y > K 2 6 > 5^ "2 , (7.37) 

\x~z\>M ^ ' 
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the last inequality due to (|7.32p and the requirement b < ^Ah+Tji ■ Therefore combin- 
ing (|7.36[) and (|7.37p , we conclude that 

m*>^( K ^ L - l l K ^ L -»)>o. 
\(2L + iy '4(2^ + 1)3; 

This bound is uniform for any positive (/i, b) satisfying the condition b < 2K^fi(L — I) and 

K 2 (L — l) 

b < 4 (2L+i)3 ana - the conclusion of the lemma follows. □ 

Theorem 7.2.2. If K is symmetric and unimodal with Kq = 1, and B x = b+(l — &)1{| X |>M} 
with L < M < 2L, then with the constant ci = c\(K, L, I) defined in Lemma \7.2. 1\ we have 

— If f 0r X [ — ' + 1)^—1] ffln ^ (Ml ^) ^/* n <? * n 

* = {(/*, b) : < 6 < min (V*f£(£ - Z), |, ^ff + ~^ ) } , 

Proof. Recall from (|7T29| that for x G [— Z + 1, Z — 1], the maximum fitness is attained at 
x = 0, and mo(7r) < 6. If b < s j 1 with ci from Lemma T7.2.H then mo(7r) < Hence for /i 
and b lying in i? C R\, where R\ is defined in Lemma l7.2.11 Lemma 17.2.11 implies that 

rn%^ — mjj > -ji. (7.38) 
Condition (|?T2"4]) and ([735)) together imply that for x G [-1 + 1,1-1] and (/i, b) G R, 



4r-(2i + l)) = m*M-m£>^-. 
tr x / 2 



Therefore for x G [—1 + 1,1-1] and (ju, 6) G R, 

< 1 < ^ 

--^ + 2 L + 1- Cl ' 

and the proof is complete. □ 

Remark 7.2.3. 7/6 = 0, i/ien Theorem \ 7. 2. 2\ works for any [i, no matter how small. 
Remark 7.2.4. The constant c\ in Lemma \7.2.1\ is small. 

7.2.3 Small ji: Existence of 5-like Stationary Measure 

Results from Chapter 17.2.11 for one-dimensional systems indicate that when b and \x are 
sufficiently small, there should be at least two stationary distributions, one resembling the 
5-measure, the other bimodal and having little mass in the middle. In this section, we show 
the existence of o"-like stationary distributions. More specifically, we establish the following: 

Proposition 7.2.5. Define k = min x \K X — K x -\\. If fi < then the set Ai is an 

invariant set for the dynamical system J7.6p , where we define 

A\ = {-k G A : ir x < t\ for all x ^ 0}, 
with ei < min(^, 2K -i™_ l+1) , wtwi)- 
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Proof. Let fi < ^ei be fixed. For n G A\, since tt x < t\ < j^, we have 7r > 1 — 2Lei > |. 
Recall that I = M — L and that for a; G [— Z + 1, Z — 1], 

to x = bK x 22, K ^' K z- 

z 

Since is increasing in [— L, 0] and decreasing in [0, L] with i\To = 1, sites in [0, 1 — 1] 
have decreasing fitness, and sites in [— I + 1, 0] have increasing fitness, and we also have the 
following estimates: 

rn — mi = b(Ko — K\) K z tt z > bkK Tt > — since tt > |, (7.39) 

z 

mi < bKiK < bKi. (7.40) 
For 7r G Ai and a; G [I, L], 



mi — m x 



= K t bY,K z ^z-K x 



x-M 



bJ2K z TT z + (1-6) £ 



x-M 



- (Jf, - K x )b K z n z -K x (l-b) 

z z= — L 

-2 

> kbK ir - K t K z ir z 

z=-L 

kh 

> ^-Kf(L-l + l)ei by (EM]) 

> 0, (7.41) 

since we assume ti < 2 k' 2 {l'-i+i) ' ^ ^he same calculation as in (|7.4ip . to_; — m x > for 
x G [— i, — Z] as well. Therefore sites in [-L, —I] are less fit than site — Z, and sites in [I, L] 
are less fit than site Z; furthermore, sites in [—1, 0] have increasing fitness, while sites in [0, 1] 
have decreasing fitness. In particular, among sites in [-L, —1] U [1,£], sites 1 and —1 have 
maximum fitness. Any measure it G Ai looks like the <5-measure. For such measures, the 
mean fitness to w is close to but less than too. We estimate the difference between m^ and 
toi: 



m-jr — mi 



/^m x Tt x — mi > too7To — mi > toq(1 — 2Lei) — mi, 



since 7To > 1 — 2Lei from the beginning of the proof. Now using estimates (|7.39p and (|7.40p . 
we continue estimating m^ — mi. 

bk 

to (1 - 2Lei) - mi = (1 - 2Le 1 )(m - mi) - 2Leim 1 > (1 - 2Le i)y - 2Le x bKi. 

Recalling from the beginning of the proof that 1 — 2Lei > h, we use the assumption €\ < 
iqlKi ^° estimate the right hand side of the above inequality: 

r „ T .bk „ T , 6fc „ r , T ^ fe 6fc 

1 - 2Lei 2Le 1 bKi > 2LbK x > — . 

V 11 2 1 1 - 4 1 WLK 1 ~ 8 
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Therefore 



bk 

mi > — . (7.42) 



Since among sites in [— L,— 1] U sites 1 and —1 have maximum fitness, (|7.42|) implies 

that for all x ^ 0, 

_ 6fc 

rn r — niir < . 

8 

We write dA 1 = B x U B 2 U B 3 , where 

£?i = {7r € 9Ai : n x = for some x and 7r x ^ ei for all x}, 
£> 2 = {tt G 9j4i : 7r x = ei for some x and 7r x ^ for all x}, 
i? 3 = {7r G dAi : n x — ei for some x and ir y = for some y}. 

For 7r G Bi, we have shown following (|7.4| that 9t7r K > at x where ir x = 0. Therefore dtir 
points toward the interior of A. For it G B2, we have for x where tt x = ei, 

d t ^x = K x {m x - m w ) +ju(l - (2L + 1)^) 

< -^y + M 

< 0, (7.43) 

since < ^£i- Thus dtn also points toward the interior of A. For it G B3, we can 
apply (|7.4[) to sites x where tt x = and (|7.43[) to sites y where ir y = e%, and conclude that 
dtTT also points toward the interior of A. Therefore the set A\ is invariant for the dynamical 
system (|7.6j) . as required. □ 

7.2.4 Small \l\ Existence of Bimodal Stationary Measure 

Results from Chapter |7. 2. 3l show that the set A±, members of which resemble the 5-measure, 
is invariant for the dynamical system (|7.6|) . In this section, we assume that < — 
and show that there is another invariant set 

A 2 = {tt : ir x < e 2 for all x £ {p, -p}, and | log £-\ < e 3 } , (7.44) 

where p = M/2 and L < M < 2L, 

1 . / ^ K P (K P -K P+1 ) Kl\ 
c 2 = -mm I (1 - v)— — ^ — > & >^rl, ( 7 - 45 ) 



2 



(2 



1 K P (K P - K p+1 ) K* 



8(2L + 1) ' 8K% +1 (L -l + iy 16K t (L - I + 1) ' 

C2 

16(2L + 1)K P 



(7.46) 



- imn( 1.^2.1,^(1 + ^) ) . (7.47) 



89 



Apparently, M must be even; this is such that = 1 but -B(p-i)-(-p) = &• Notice 

that since M <2L,l = M- L<M-^- = ^-=p, and also p — ^- < L. Thus I <p< L. 
Members of are bimodal distributions with very sharp peaks at sites p and —p. 

For 7T e A 2 , tt p + tt_ p = 1 - ^^p.pJTj > 1 - (2L - 1) 8(2^+1) b y tne condition 
tt x < f^2L+i) ^ 0r 35 i {P> therefore 7r p + 7r„ p > |. Now since | log ■£ s -\ < log 2, we have 

1 < JLe_ < 2. This means that 

2 — 7T_« — 



min(7Tp,7r_p) > -, 



(7.48) 



for otherwise, say ir p < j, then ir^ p > |, which means that < | < |. 

We use the same idea that we used to establish the invariance property of A\ to 
show that Ai is also an invariant set for the dynamical system, for < % 2 -£2- 



Lemma 7.2.6. For any tt 6 A2, the following estimate holds: 



m x > — for x e [-L, L]\{p, -p}, 



min(m p , m_ p 

where A2 and C2 are defined in {7.44}) and \7.45% respectively. 



(7.49) 



Proof. As in the proof of Proposition 17.2.51 we first establish a few bounds on fitness of 
various sites for n G A2 . For x = p, 



bJ2K z n z 



-p 

b) 

z=-L 



> K p (l - b)K. 



p n —p 



> 



(l-b)Kl 



(7.50) 



where we use ()7.48|) in the last inequality. For x G [p + 1 , L] , 



= K n 



z=-L 



-K x 



x-M 



bj2K Z 7r z + (i-b) K **» 



z = -L 
x-M 



= (K p - K x )bJ2 K z*z + i K v - ~b)J2 K * w * ~ - fo ) H K **" 

z z——L z——p+\ 

Since x e [p + 1, L] and K x is decreasing in [0,L], we have K p — K x > K p — K p+ i > 0. 
We apply these facts, along with (|7.48|) and the requirements on ir x for tt 6 A 2 , to the right 
hand side and obtain 

m p -m x > 0+{K p -K p+1 ){l-b)K- p ^-K p+1 (l-b)(L~l + l)K- p+1 e 2 



(1-6) 



> (1-6) 



if p (g p -X p+1 ) 
4 

K p (K p - K p+1 ) 



K p+1 K p ^{L - I + l)e 2 



(7.51) 
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-p -I 



I p 



Figure 7.3: Illustration of A2 



for xe]p + l,L] since e 2 < s ^l_^i+iy Fot ^[0,1-1], 

m x = 



therefore using (|7.50[) and the inequality above, we have 

m p — m x > 



(1 - b)K 2 v 



since 



A" 



1-6 



< For x e [l,p-l], 



K x 



e-M 



— L 



-/ 



< K t (bKoJ2^ + K o 62 ) 

z z=—L 

< K l (b+(L-l+l)e 2 ), 
therefore using (|7.50[) and the inequality above, we have 



m p — m x > 



(1 - b)K. 



j— £ - lU{h + (L — 1 + l)e 2 ) 

-f-b [^ + Ki \-Ki{L-l + l)e2. 



K 2 K 2 ( K 2 \ K 2 

Since b < 2 k 2 +k 1 k ' we nave ~i~~~ b + ^ J — if'- Furthermore, since e 2 < 



(7.52) 



K 2 



WK,(L-l+l) ■ 



we have 



m p — m x > 



if 2 
Iff 



(7.53) 



The estimates (|7.5ip . (|7.52p . and (|7.53|) compare m p with to x for a; £ [0,L], Similar cal- 
culations comparing m„ p with m x for x € [— L,0] yield similar results. Then recalling the 
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definition of c 2 in (|7.45p . we have 



> c 2 for x e [0,L]\{p}, 



and m^p — m x > c 2 for x S [-L, 0]\{— p}. 
To establish the lemma, it suffices to compare m p and m_ p : 



(7.54) 



\m p - m_ p | 



K„ 



K p {l-b) 



bJ2K z n z + (l-b) K z ir z 

Z =-L 

L 

Z Z —p 

L 

^K z (ir- Z - ir z ) 



K 



z =p 

L 



< K p {l-b)Y,K z \ir- z -n z 



Since K z < K p for z € [p, L], iT^ < -ftT_ p = if p for 2 G [— L, — jj], and tt z < £2 for z 7^ —p,p, 
we have 



K-TO_ p | < i^(l-6)(|7r_p-7rp| + (L-p)2e 2 ) 



(7.55) 



We treat the two terms in the above sum separately. We first deal with the second term 
2K^(L - p)e 2 . Since L - p < 2L + 1 and K p > K%, we have 16 ^l+i)k p - 8(l- p )k* ■ Then 
the definition of £2 in (|7.46p means that £2 < w( 2 l+i)k — s(L-p)K' A > which implies 



2K 2 p (L-p)e 2 < 



f'2 



(7.56) 



For the first term in (17.55[) . we can divide into two cases: tt p > 7r_ p and ir p < 7r_ p . If 
7r„ > 7r_„, then 



< TT p — 7T_ p = TT p — e €3 7T_ p + e £3 n _ p 



e J 7T_,, — 7T_, 



(7T p - e £3 7T_ p ) + (e £3 - l)7r_p. 



The definition of £3 in (|7.47|) implies that ir p < e ea ir_ p and e £3 — 1 < 4^7, so continuing the 
calculation in the line above, we obtain 



1% " TT-pl = ^p - 7T_p < + T^-p < T^2 • 



(7.57) 



If 7Tp < 7r_p, we get the same bound. Therefore applying (|7.56p and (|7.57|) to l|7.55[) . we 
have 



1 , c 2 c 2 c 2 
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This result means that the estimate in (|7.54[) can be generalized to 

min(ra p , to_ p ) - m x > — for x e [-L, L]\{p, -p}, 
as required by the lemma. □ 

Proposition 7.2.7. If /i < then the set A2 defined in ^7-44\ ) * s an invariant set for 

the dynamical system J7.6p . 

Proof. As in the proof of ProDOsition l7.2.5l it suffices to observe that d t Tt x > where n x = 
as shown by the argument following (|7.43|) . and check that the following inequalities hold: 

(d t 7T x )\ 2 ^ A2 < for x ± p, -p, (7.58) 



d t log 

7T_„ 



< 0, (7.59) 

log -^-=C3,ieA2 



and ( d t log ) 



— P 



< 0. (7.60) 

log _T P =£3,ir£A 2 



The estimate (|7.49l) means that for 7r G ^2 and x £ {p, —p}, m x is significantly smaller than 
min(TO p ,m_p). But the mean fitness cannot be much smaller than min(m p , m_ p ): 

min(m p , m_ p ) — = min(m p , m_ p ) — rn x TT x 

X 

< min(m p , m_ p ) — m p ir p — m_ p 7r_ p 

< min(m p , m~ p ) — min(m p , m_ p )(7r p + 7r_ p ) 
= (1 — Tip — 7T_ p ) min(m p , to_ p ) 

< (2L — l)e 2 min(m p , m_ p ) 

by the requirement that n x < ti for x ^ p, —p and tt e A 2 . Since 

m p = if p ^ z B p _ z K z ir z < K p Kq and similarly m_ p < K p Kq, we have 



min(m p ,m_ p ) — m w < (2Z — \)e2K p Kq = (2X — I)e2-K 



We use the definition of e 2 in (|7.46|) to obtain e 2 < i 6 ( 2 l+i)A' < s(2L-\)k ' wmcn , when 
applied to the estimate in the line above, implies that 

min(m p , m_ p ) — m w < (7-61) 
8 

Now ([7T49]) and (fT6T|) imply that for tt e A 2 and x ^ {p, -p}, 

m x - m,r < m t + — - min(m p , m_ p ) < -— + — = — — . 
Thus for 7r € 9v4 2 n {7r 2 = e 2 for some z ^ {p, — p}}, we have for x where ~k x =62, 

{d t n x )\n x=C2 = ^x(m x -m n )+fj,(l-(2L + l)'?r x )\ 7r!e=e2 

, 3c 2 

< - £2 — 

< 0, 
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since /j, < ±f-e 2 . This verifies (|735)) . 

Now we deal with ir S dA 2 fl {log = €3}. By (|7.6|) , we have 



<9 t log — — 



(m p 



fj, /j, 

TOtt H fi(2L + 1)) - (m_p - TO?,- H 

7T„ 7T_ 



>p 

/' I — 

7T„ 



1 



M (2L+1)) 



(7.62) 



If -^2- = e £3 > 1, then 7r„ > 7r_„, which means that — — < 0. Therefore it remains to 



check the sign of m p 



If 



bY,K z 7r z + (l-b) J2 K *** 



— L 



tfp(l-fc) *sTp(7r_p-7rp)+ K *(* 
2 63 , then 




(1-6) 



(7.63) 



7T_ p (l 



7T_ p max(- 1, 



C2 

4if 2 



)< 



C2 



16^ 



(7.64) 



by the definition of €3 in (|7.47[) and the fact 7r_ p > \ established in (|7.48|) . For tt £ A 2 and 

z > p + 1, 7r_ z — 7r z < 2e 2 , therefore 



E 

z=p+l 



K z (ir- Z -7T Z ) < (L-p)K p 2e 2 . 



Applying (|7T64l and ([7T65]) to (f7T63|) . and using the requirement e 2 < 



C2 



16(2L+1)K P 



(7.65) 



KM, 



which implies e 2 < 32 k-z(l-p) > we conclude that m p — m_ 
implies that 



< if = e £3 . Therefore (TE^i 



d t log — — 



<0, 



which verifies (|7.59|) . The verification of (|7. 60[) is similar, and the proof is complete. □ 

Propositions 17.2.51 and [7T2~71 imply that if b is sufficiently small and /i < C36 2 , where 
C3 is a small constant dependent on K and L, then (|7-6[) has at least two stationary distri- 
butions, one resembling a 8- measure and the other bimodal and having little mass in the 
middle. But Theorem 17.2.21 imply that if b is sufficiently small and /x > C46, where C4 is a 
large constant dependent on K and L, then all stationary distributions are bimodal and have 
very little mass in the middle. This phenomenon is illustrated in figure I7T41 We conjecture 
that there is a phase transition between a unique stationary distribution and two stationary 
distributions in the behaviour of (|7.6j) for small \x and b. But since we cannot come up with 
a straightforward comparison argument in either /i or b, this remains a conjecture. 
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b 

Figure 7.4: Conjecture of phase transition in (|7.6p for small /x and b (Shaded region is where 
Theorem E2H Propositions [7X5] and [L2Jl work) 
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Chapter 8 

Stationary Distributions 



In Chapter we examined the large-time behaviour of the deterministic dynamical sys- 
tem (|7.1[) . i.e. lim^oo limAr^oo tt n (t). In this chapter, however, we will take the limit t —> oo 
first and examine the behaviour of v^' N , the stationary distribution of ir N ; more specifically, 
we will do this by taking the limits TV — * oo then // — » Q and examine lim^^o lim-iv— «» v^' N ■ 
For this, we consider the case of symmetric, strictly positive, and unimodal K, with K x 



1 



{\x\>M} 



with L < M < 2L. 



strictly decreasing for x £ [0, L] and K$ = 1, and B x = 
Define I = —L + M, then m x — for x G [—1 + 1,1 — 1]. Define C3 = sup^ra^. For the 
strong selection model with N particles and mutation rate fi described in Chapter 16.2. 1[ 
we observe that this continuous- time finite-state Markov process has the property that all 
states communicate, and therefore it has a unique stationary distribution 1/^' [Durrett 
1991]. Let (G N ,T>(Q N )) denote the generator associated with this Markov process, then for 
all F € C°°{V{E)) C V{g N ), we have 



G N F{tt)^- n \dn) = 0. 



(8.1) 



Let (G,T>(g)) denote the generator associated with the deterministic process de- 
scribed by the ODE I^TD|) : 

d t TTx = Tr x (m(x, 7r) - m^) + fj,(l - (2L + 1)tt x ). 



We calculate the effect of the generator Q N on a C°°-function 



F(ir 



x yjkx 



F[n N - — + — )- F(n N ) 
1 N N ' v ' 



^EE 

X y^x 



F 7T 



I 



-N 



N' 



F(tt 
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Performing a Taylor expansion on F, we continue the above computation: 

g N F(7r N ) 



^EE 

X y^x 

EE 

X y^x 



Ndir" 



{7r } ~^F (7r } 



9< 



Trf (m(p>; + M ) + ft^F)^). (8.2) 



Since sup KiJ/i7r n£(m(y, tt n ) + n)0{N- 2 ) = 0(N- 2 ), we have 



R x {N,F)(-k) < 



C\\F\\ 
N 1 



where we suppress the dependence on L. Therefore R\ ( N, F) (it) is 0(N 1 ) uniformly for 
all 7r € V(E). Here we use the norm 



\F\\ = 



J (i+ E tf) iW# 



associated with the Sobolev space H 2 (M. E ) for F, where F denotes the Fourier transform of 
F. The first term in (|8.2p is in fact equal to QF(tt n ) by the following computation: 



EE 

X y^x 



OF dF 



C*7T 



EE 



^ (7r )_^ (vr ) 

9% d-K x 



TT x {m(y,n)iTy + fi) 

Tr x (m(y,ir)ir y + fj,) 



Qp Qp 



a; j/ 



= Q^-{ 7T ){ m (^, 7T ) 7T x + A 1 ) - ("V + (2L + l)/i)y^7T x ^— (7r) 

a; K a: 1 
^ Qp 

= ^[7r x (™(a;,7r) -m,) +A*(! ~ (2i + 1)^)]^— (-zr) 

= 0F(tt). 

Therefore (|8.2p can be written in the following much-simplified form 
Q N F(n N ) — GF(n N ) + Ri(N, F)(ir N ). 

Then (f5TT|) implies 



J gF{ir)^ N {dn) 



R 1 {N,F){tt) V ^ n ((1tt) 



< 



C\\F\\ 
N 



C\\F\\ 
N ' 



QF(-k)v»> n {dn) =0(N- 1 ). 



(8.3) 
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Since E is compact, so is V{E) and ^(^(-E)), therefore for each /x, we can take a sequence 
jV fc (//) such that V ^ N ^ converges weakly to some G T , (T'{E)). By (|8.3[) . satisfies: 
for all F G C°°(V{E)), 



0F(7r)^(d7r) 



^[7r x (m(a;,7r) -m,) - (2i + 1)^)] — (tt)^^) 



= 0. 



(8.4) 



Therefore i/ M is an stationary distribution for the deterministic flow Q. Now we take a 
sequence fik — ► 0, such that z/ Mfc converges weakly to some i/° G V(P(E)), and by (|8.4p and 
the following estimate: as fj, — ► 0, 



| £ M (1-(2L+1K)^(ttK(^) 



<HC(F,L) 



= /iC(F,L)-0, 



v° satisfies: 



^7T x (m(a;,7r) - m ir)g^~ {K)v a {dir) = 0. 



■5) 



After establishing several lemmas, we will use the above characterization of v^ 1 and v to 
prove the following: 

Theorem 8.0.8. Suppose K is symmetric, unimodal, and strictly decreasing for x G [0, L], 
with Kq = 1, and B x — l{|a;|>M} with L < M < 2L. Define I = —L + M. If is a weak 
limit point of v^- N , then for any z G [— I + 1,1 — 1], we have 

u^{ir : tt z > <5} < - 



5f +M(2£+1)' 
where S 2 = mm ^2(2l+i) ' 4(2L+i)K? > 2(2L+i)^ ) ■ 

Corollary 8.0.9. Under the same assumption on K and B as in Theorem \8.0.8[ we have 

v°{ir : tt x = Vx G [-1 + 1, 1 - 1}} = 1 
iff is a weak limit point of i/ M , and consequently, for any 6 > 0, 

u^^iit :tt x < S Vx G [-1 + 1,1- 1]} > 1 -S 
for some sufficiently large i and j = ji . 

Proof of Corollary 18.0.91 We recall that if a sequence of random variables X n X^, 
then liminfn^oo P(X n G A) > P(X oa G A) for any open set A. Thus Theorem 18 . 0.81 implies 
that for any z G [—1 + 1,1—1] and sequence m such that 



/ Q {n : 7r z > 5'} < liminf v^^tt : tt z > 5'} < liminf — 



IM 



5' f + W (2L+1) 



= 0. 
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This holds for any positive 5', so v°{n : tt z > 0} = and therefore, 
i>°{n : n x = Wx € [-1 + 1, 1 - 1]} = 1. 

□ 

Lemma 8.0.10. If fi < 1, then 



{ 7T : ir x > Vz G \-L, L] } = 1. 

\ 4(2L + 2) L ' J J 

Proof. Let z G [— £, L] be an arbitrary site. Let 8 < 2 (2L+2) ^ e a sma U positive constant 
and / G C°°(M) be a function that satisfies the following requirements: 

(a) f'{x) = 1 for x < §; 

(b) /'(x) = for x > 8; 

(c) f'{x) G [0,1] for x G [§,£]; and 

(d) /(l) = 0. 

Define ^(tt-l, . . . ,tt l ) = /(tt z ). Then (JHUJl implies: 

7r z (m z - SiJ + m(1 - (2L + l)w z )]ffa)v»(dw) = 0. (8.6) 

Since m z > and m n is bounded above by 1 uniformly in w, we have m z — > — 1. 
The integrand in the above integral is nonzero only for ir z G [0,(5], therefore (|8.6p can be 
rewritten as: 

/ [7r z (m z -m 7r )+M(l-(2L+l) 7 r 2 )]/'(7r 2 K(d7r) = 0. (8.7) 

J{7r:7T s <5} 

Furthermore, the integrand is bounded below: 

n z (m z -m ff ) + /i(l - (2L + 1)tt z ) > -tt 2 + - (2L + 1)tt z ) 

= -(l + //(2i + l))7r z +/i. (8.8) 

Since < 1, we have (5 < 2 (i+2L+i) < 2(i+^(2L+i)) > an d therefore if 7r z G [0,8] C 
[0, 2(1+M f 2L+1)) h then (1 + /i(2L + 1))tt z < § . Thus (^J implies that 

7r z (m z -rn 7r )+/i(l-(2L + l)7r z ) > |. (8.9) 
Applying the estimate (|8 . 9|) to l|8.7[) . we obtain 

a 

2 
2 



> J / /'(TT.KfdTT) 



{7r:7r 2 <(5} 

r 

{7r:7r i <(5/2} 



> J/ /'(7r z )^(d7r) since /' (x) > for a; G [5/2, 8} 



2 



^(rfTr) since /'(.x) = 1 for x G [0, <5/2] 

{7r:7r*<*/2} 



<! 7T : 7T 2 < — 

2 I 2 
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Therefore 



which implies that 



{ 7T : 7T Z < - \ = 



: tt x < ^ for some x G £]| < ^ ^ M : 7^ < || = 



Thus the lemma follows. 

Lemma \8 . . 1 01 implies that ^ M -a.s., 

Win > tt-lK-lKlttl > 



16(2L + 2) 2 



□ 



(8.10) 



Recall that 5 2 = min ( a(i £+i) ' 



A'7 



2 r.- v 

16(2L+2) 2 



and 



A = {-k < S 2 }, 
and the function V : 7>(-E0 R, 

= ^("^ - rn^) 2 + /u(2i + 1) 2^ 



)Kf' 2(2L+i)' 1 J 1S independent of /i. We define Ji = 

(8.11) 



1 



2L + 1 



.12) 



We recall from a formula following (|7.7p that = 2m K , and observe that 
0m7r = y^jTr x (m x - m n ) + n{\ - (2L + l)7r x )]^— ^ 
= y^[7Tg(w x - m OT ) + - (2£ + l)7r a )]2m x 



= 2 ir x m x (m x - m n ) + /i(2X + 1) m x ( ^ ^ - 7r x 
Adding —2 ^ x ^1^^ — m^) = to the right hand side, we continue as follows: 



5m,r = 2 



= 2 



y^^ x m x (m x - m^) - ^V^m^m^ - rn n ) 

X X 

+ K 2L + !) m2; ( 2X + 1 ~ ^ 

a; ^ 



Therefore 



£77^ = 2t/>(7r). 



.13) 
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We will establish in the following two lemmas that on the set A defined in (|8.11[) . - 0( 7r ) is 
bounded below by ^ 2 l+i) We write A as a disjoint union of two sets A\ and A 2 , where 

A\ = 1 7r e A : tt x < — for all x with m x 7^ 1 , 

\ 2L+1 xr /' 

A 2 = 1 7r G A : 7r T > for some a; with m r 7^ 1 , 

1 2L + 1 ^ /' 

and prove a lemma for each case. 

Lemma 8.0.11. For any n € j4i, we /iawe VK 71 ") > 2L 1 +1 'm^. 
Proof. For any 7T G Ai, there are two cases: 
Case 1 tt x < 2£ 1 +1 for all a; € [— L, — Z] U [I, L}; 

Case 2 7r x > 2L 1 +1 and m x — for some (possibly more than one) x € [— L, — I] U [/, L], but 
for any x with 7^ 0, is still < 9L 1 +1 . 

For Case 1, (gHU) implies 

Since = for x E [—1 + 1,1 — 1] and ir x < 2 l+i f° r a ^ x & ']> the secon d sum on the 
right hand side is nonnegative, and therefore 

1-1 Z-i 
^(tt) > X! "'"K ~ = X! 7r ^ 7 "^' ( 8 - 14 ) 

x=-2 + l x=-i+l 

Using 

i-i -i i o; 1 

E X — > ^ — "v Zt — 1 

7r a = 1 - 2^ - 2^ ^ - ST+i' 

ce=— i+1 x— — L x—l 

(gHH) implies that 

^ * 2TTT< * 2LTi<" (8 ' 15) 

We now turn to Case 2. Suppose y is a site with ir y > 2L 1 +1 and m y — 0. We bound 
iP(tt) in (|8.12[) a bit differently from Case 1. Since ir x < 2 l+\ f° r an y x with m x 7^ 0, the 
second sum in (|8.12p is nonnegative, and therefore 

X 

Therefore (f8TT5]) and (flTTo]) imply the le mma. □ 
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Lemma 8.0.12. If ji < ^l+i^k' 1 ' ^ en f or an y n € ^2, we have ip(ir) > 4 ( 2 '2+i 



Proof. For tt £ A 2 , the main inconvenience is that the second term in (|8.12p . i.e. the term 
involving 9TX1 — fi, can be negative. We divide into three cases and show that in each 



2L+1 

case, we have 



V>(tt) > 



4(2L + 1) 



Case 1. For all x G [— £, — Z], 7r x < 2L 1 +1 , and i G [Z,L] is the rightmost site with it. > 9L 1 +1 , 
i.e. there is no x to the right of i with ir x > 2 l+i > 

Case 2. For all a; € [I, L], n x < 2 l+i 1 an< ^ * [ — L, —I] is the leftmost site with n. > 2L 1 +1 , i.e. 
there is no x to the left of i with tt x > 2£ 1 +1 ; 

Case 3. i% G \—L, —I] is the leftmost site with it. > 2L 1 +1 , and i 2 S [Z, L] is the rightmost site 

With 7T. > 



2L+1 ' 



> 



2L + 1 



< 



2L + 1 



i-M 



L 



Figure 8.1: Illustration of Case 1 



We first deal with Case 1. The terms in the first sum of the definition of ip in (|8.12|) 
are squares and therefore nonnegative, hence we can throw some of them away and obtain 
the following: 



i-M 



x— — L 



) 2 + M (2L + l) ]T 

x — — L 

+7Ti(m, - TOtt) 2 . 

For x € [— L,i — M], B x -i = 1, therefore 



1 



2L+ 1 



.17) 



= K x J2 B x-zK z ^ z > Kim 



> 



z=x+M 



2L + 1 
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Applying the above and the requirement < 2 (2L+i) ^ or ^ ^ A to (|8.17[) . we obtain: 



i-M i-M , r2 r2 



x——L x——L 

K 



2L + 1 2(2L + 1), 

i-M 

4(2L + TF E (8-18) 

x=—L 

This deals with the first term in (|8.17[) . For the second term in (|8.17[) . we observe that 
m x — for x G [— i, i], and 2L 1 +1 — 7r x > for x £ [-L, —I] U [i + 1, L], therefore 

L 



Em„ ( 7r T | > m T | 7r 
\2L + 1 x ~ ^ x \ 2L + 1 



x——L x—l 

Applying the universal bound 2L 1 +1 — 7r x > — 1 to right hand side above, we obtain 

L 



E mx \ 2L + 1 ~ nx ) - ~E TOa; - ( 8 ' 19 ) 



-L v 7 z=i 

Now applying ()8.18j) . (18.19|) . and the requirement iii > 2 l+i t° (|8.17[) . we obtain 

> A{2 L + ir ^ ^-M(2£ + l)^mx + ^K-m») 2 . (8.20) 

— L a:—/ 

We observe that 

i i x— M i x—M i i—M 

x—l x—l y — — L x=l y— — L x—l y— — L 

i-M 

< K?(L-l + l) ^ % X <K?(2L+1)%([-L,i-M]). (8.21) 

x— L 

Therefore the computation in (I8.2Q|) can be continued as follows: 

V>M > ^ 1)2 tt([-L, z - M]) - nK?{2L + l)M\-L, i - M]) 

+ 2LTT (m ^^ )2 

K 4 1 

> ^—^tt(\-LA-M]) + (mi-mJ 2 (8.22) 

~ 8(2i + l) 2 U ' J; 2L+ 1 V ; V y 

a: 4 

since /i < g^L+i) 4 ^^ - The r ight hand side of the above is a sum of two nonnegative terms, 
both of which cannot be small at the same time. Indeed, notice that 

i-M 



= K,J2 K *v* < Kfr(l-L, i - M\). 



x=-L 
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Therefore, if n([— L,i ~ M\) < -^^m^, then rrii < ^m^, and we have (m, — m^) 2 > jin 2 ,, 

1 2 

hence the second term in (|8.22|) alone implies > 4 (2L+i) • Otherwise, 7r([— L, i — M]) > 

2^5- mjr, then the first term in (|8.22|) implies i/'( 7r ) > i6(2£+IpK^ • ^° we nave the following 
estimate on 



> min 



yiQ^L+iyK^ 4(2£+l) 



(8.23) 



Since <5 2 < 4(2L ^ )A ,a , on the set A = {7r : < <5 2 }, we have ^^L+iyK? - Mjtfi 
therefore (|8.23|) implies 



4(2L + 1)' 



> 



2L + 1 



< 



2L + 1 



-L i 2 -M A -/ 



'2 



A L 



h + M 



Figure 8.2: Illustration of Case 1 

Case 2 follows by exactly the same argument. For Case 3, we first observe that if 
* 2 — ii > M then Sj 2 _jj = 1 and on A — {n : < <5 2 }, where by definition 5 2 < 2T2T+1T 2 ' 
we have (5 2 > > Ki 1 Ki 1 Ki 2 iTi 2 > 72T+TP ' wn i cn is impossible. Thus i 2 — i\ < M, and 



(2L+Tp 

i 2 -M L 



X — — L x—ii+M 

L 



-ir^im^ - m^f + fi(2L + 1) ^ 



x— — L 



1 



2L + 1 



.24) 



We can use techniques similar to those used for Case 1 (leading to (|8.18J) and Q8.21[l ) to 
obtain the following bounds: 



i 2 -M 



^ n x (m x - rn^f + ^ n x (m x - m^f 



x=ii+M 



> 



K 



4(2L 



z 2 - M]) + 7r([ii + M, L])) 
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and 

X>* Girl - ^ X>* Gztt - **) + £ 



12 — i 



2L + 1 



> -^rn x - ^ m x 

a?=Z ii 

> -/^ 2 (2i+l)7r([-L,i 2 -M]U [*i + M,i]). 
Since /x < g^L+i) 4 ^ 5 ^ e a ^ ove tw0 estimates applied to |8.24[) implies that 

- 4(2£ + 1)^ ([ ~ L ' * 2 ~ M] U [tl + M ' L]) 

-/z(2Z + l)K?(2L + !>([-£, i 2 - M] U [i x + M, L]) 

K 4 1 

+ 8(2lW* ([<1 + M ' L]) + 2lTT (m - -^ )2 - 

We can now apply the technique leading to (|8.23|) to the sum of the first two terms above, 
and then to the sum of the last two terms, to obtain: 



V>(tt) > 2 min 



ml 



\16(2L + 1) 2 K?' A(2L + 1) ) ' 



which by virtue of < 62 < ^L+VjK' 1 011 implies 



2(2L + 1) 

Thus we have established the necessary bound on ij>(ir) for w £ A2 in all three cases. □ 



Lemma 8.0.13. If 8 1 = 16 ^ 2)a and « 2 = mh^^^y, 4(2L +W , ^^fey 



then 



v»L:m v < || =0. 
Proof. Recall from (|8TTU|) that i/"-a.s., 

> (8.25) 
Let / € C°°(R) to be a function that satisfies the following requirements: 
(a) f'(x) increases from to for x £ [0, <5i]; 
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(b) f{x) = £ for 8 1 <x<%; 

(c) f{x) decreases from ^ to for x £ [4f , 

(d) /'(as) =0fora;> ^. 

Define F(tt^ l , ...,tt l ) = f(m„). Then JE3| implies: 



Substituting (]8 . 13[) into the above equation, we obtain 

f{rn^(n)^{dTT) = 0. (8.26) 



Lemmas 18.0.111 and 18.0.121 imply that tp(n) is bounded below by jr^L+Tj w ^ 011 ^ ne se ^ 
A = {tt : m-n < 62} , defined in (|8.11|) . Applying this fact and (|8.25|) to (|8.26D . we obtain 

= 2/ %1)(tt) f' (jn^v^ 1 (dn) since f'{x) only nonzero for x in [0, ^f 3 -] 



{7r:0<m„<5|3-} 

2/ VW/'(^7r)^(d7r) since ^{tt : < <5i} = by (H301) 

J{^:5i<rn x <^2-} 



> 2 / VM/'(m^K(*0 since ^(tt)/'^) > if m T £ ^] 

J{7r:5i<rn x <^-} 

> 2 / — - — 7=9-^(^7'") by the bound on ip(m„) for 7r e A 

^{Tria^rn^^} 4(2L + 1) m; 



2(2L + 1) J {^s-lKTu^^-} 
Therefore 

^ ^7T : m,r < y| = 1^ {tt : < m v < 5x} + Itt : 8 X < m w < y j = + = 0, 
as required. □ 

Proof of Theorem 18.0.81 For an arbitrary site z £ [—1 + 1, 1 — 1], we have m z — 0. If 
we take F(ir) = tt z , then by (|8.4j) . we have 

7r z (m(z, tt) - ttTtt) + /u(l - (2L + l)7r z )^(rf7r) 
^-{m^+ /j,(2L + l))7r z ^(d7r), 



(m T + /i(2i + I))7r z z/ M ((i7r) 

(m 7r + /i(2L + l))7T 2 ^(d7r) 

{7r:riT x >(52/2} 

(m^ + ^i(2L + l))7r 2 i/'*(d7r), 

{•n-:m„<<5 2 /2} 
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where 62 is as defined in Lemma 18.0.131 The same lemma shows that {n : m n < £2/2} has 
i/^-measure 0, so the second integral in the above equation is 0, thus 

H= f {rn^ + fi(2L + l))Tr z ^(dn)> [ ( ^ + fi(2L + 1) ] Tr^drr), 

i.e. 

The observation ^{tt : ir z > 6} < i J n z v^(dii) completes the proof. □ 
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Appendix A 

A Result on the Conditioned 
Dieckmann-Doebeli Model 



In this section, we deal with a special case of the conditioned Dicckmann-Docbeli model 
described in (|6.2p and (|6.3|) . and show that in this special case, there exist symmetric 
bimodal stationary distributions. Let E = [-L, L] n Z, A be a Markov transition matrix 
associated with mutation, n(t) € V(E) for all t G Z + , and M be an even constant such 
that L — 1 < M < 2(L — 1), then the equation of the discrete-time dynamical system is as 
follows: 

*,(* + !) = 5>(y^ * Wt)) 



where F x (tt) = V; (2) (7r) = 



Ez C X — Z 1T Z 
K x = l{|x|<L-X}) 

and C x = 1{| x |<m}- (A- 1 ) 
Every step of (|A.1[) can be divided into three sub-steps: 

Resampling ^{t) = n x (t)V x (ir(t)); 

Mutation n'^t) = y ^A{y,x)-K x (t); 

v 

ii /+\ 

Normalization n x (t + 1) = = - , . (A. 2) 

Notice that performing the normalization step before the mutation step does not change the 
model, but for this section, we will use the step order in (|A.2|) . 

If if, C, and A(y, x) — A(y — x) are symmetric about 0, then the map Tr(t) i— > 7r(t+l) 
maps the set of symmetric probability measures on [-L, L] to itself. Therefore by Brouwer's 
fixed point theorem, there exists symmetric stationary distribution(s). We first derive a few 
simple facts about symmetric stationary distributions in the no-mutation case, i.e. when 
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Figure A.l: Relative locations of various sites of interest 



A = I. In this case, if v° is a stationary distribution of (jA.lj) . we must have 



if vl ^0, then V x (v°) 



l~,z=x-M y z 



is a constant. 



(A.3) 



This is the same condition as (|6.5p . Since A^ = outside the interval [-L + 1, L—1),V X =0 
outside that interval, i.e. at x = ±L, therefore the support of any stationary distribution 
v° must lie in [-L + 1,L — 1]. We restrict our attention to sites in [-L + 1,L — 1]. If 
the competition intensity function C is rectangular, as in (|A.1[) . two sites either compete at 
intensity 1 or they do not compete against each other at all. If M = L — 1, then is the 
only site that competes with every other site in \—L + 1, L — 1]; but if M — 2(L — 1) — 1, 
then for any site x G [-L + 2, L — 2], x competes with every site in [-L + 1, L — 1]. Define 
/ = — (L— 1 — M), then [—2,2] contains the sites that compete with every site in [— L+1,L— 1], 
therefore 



A' 



1 x €[-/,/] 



V x+M , , iv° I x € (oo, -L] U \L, oo ) 



(A.4) 



We also observe that f- > 2 because M - 21 = M + 2(L-1- M) = 2{L - 1) - M > 0. If 
there is some mass in [— L + 1, — 4p — 1] U [4|- + 1, Z — 1], then since no site in [4£ + 1,L— 1] 
competes with any site in \—L + 1, — 4p — 1], we have 



Z=E-M 



for x e [-£ + 1, -4f — 1] U [-y + 1, L — 1], which by the definition of V x (tt) in (|A~l"j) implies 
the following: 



V x (v°) > 1 for x € 



M 

-L + l,- T -l 



u 



M 



(A.5) 



Combining the results on fitness V x (w) in (|A.4[) and (|A.5[) and condition (|A.3[) . we conclude 
that stationary distributions with rectangular K and C as defined in (|A.1[) must have all 
the mass falling in either [-L + 1, -M - l] u [f- + 1, L - l] or [-f-, f-] . 

Now we turn to the case with small mutations. We take to be an operator that 
corresponds to a small 1-step mutation, i.e. convolution with /i<5-i + (1 — 2p)5o + /i5i, and 
define to be a stationary distribution of (jA.lj) with mutation kernel A^ . Any stationary 
distribution of (|A. 1|) satisfies the following condition: 



Vx e [-L, L], ,<^14-i + (1 - 2i£)v%V x + ^ +1 V X+1 = Vug, 



(A.6) 
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where V y = V y {y^) and 



V = V(v»)= < V ^) 



is the normalization constant. Condition (|A.6|) implies that is nowhere zero in [—£,£]; 
otherwise, say v% = 0, then then u^_ 1 = v% +1 = as well, which by induction means that 
v£ = for all x, a clear contradiction. Since the support of has expanded on both sides 
each by 1 site compared to v°, the sites where V.{v^) is constant 1 should correspondingly 
contract by 1 site on each side: 

r l xe[-l + i,I-i] 
V,(y») = = I >1 xe[-L+l,-l]u[l,L-l] . (A.7) 



i£ (oo, —L] U [L, oo) 



L-iz=x-M 

Therefore, for sites in the middle, i.e. x E [—1 + 2, 1 — 2], ^ M satisfies: 

+ (1 - 2 M K + = F(^K. 

Since every site x in [— L + 1, L — 1] competes with all sites lying on the same side 
(with respect to the origin) as x, and the stationary distributions we consider are symmetric, 
Y^ztx-M v x i s at least | for a; G [— L + 1,L — 1], therefore 

V x {v») <2fora:e [-L + 1,L-1], (A.8) 

i.e. the normalization constant V(v^) is bounded above by 2. We also need a nontrivial 
lower bound of V(y^) for symmetric that is uniform for small /i for the proof of the 
upcoming theorem. We rewrite condition (|A.6|) for x = L and x = L — 1, taking into 
account the fact Vl(^) — from (|A.7|) : 



L-1- 



K-2^-2K) + (1-2m)^-iVl-iK) = V> M K 



Dividing both sides of the above two equations, we get 



-L-1 K-2^-2(^) + (1 - 2 M )^_ i y i _ 1 (^) 

K-l^L-lK) /i 



< 



(l-2/i)^_ 1 VL-i(^) 1-2//' 



-L 

that v>£_i = v^ L+1 <\-Vl<\, hence Ji L > 1, a contradiction. Since V x (v^) > 1 for 



which is < 1 if [i < \. This means that = v^_ L < \, for otherwise, i/£ = i/^ > -j implies 

hat = i^ i+x < | - v£ < |, he: 
x 6 [-L + 1,L - 1] by (|A.7|) . we have 

L-1 



L-1 L-1 



z=-L+l z=-L+l 
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The fact Vl = V -L — 3 then implies 

V{y»)>l-vt L -vl>\ (A.9) 

In particular, V(v^) is bounded between i and 2. We take a sequence fi n — ► 0, such that 
v n = 2/^™ converges to some then since V{v) is a continuous function of v, V n — V(v n ) 
also converges to a positive constant V . We will prove the following: 

Theorem A. 0.14. If v n = v^ n is a convergent sequence of symmetric stationary distribu- 
tions for the conditioned Dieckmann-Doebeli model in iA.l\) , then v? M M — > asn->oo. 



Proof. Since v n — > i/° and V{v n ) — ► Vasrn oo, condition (|A.6[) converges to the following: 

Vx G [—If, Ir], ^X(^°)=^ 

Therefore ^° is in fact an stationary distribution for the no-mutation case, i.e. (|A.1[) with 
A = I. If some mass of v° lies in [-1 + 1, -4f - l] U [4f + 1, 1 - l] , then i/P_ M m must 

[ 2 ' 2 J 

be zero, which means that is? M M i — ► when n — ► oo as required. Therefore it suffices to 
show that ^_ l+1j _m_ i]u[ m +1jL _ i] > 0. 

We assume, toward a contradiction, that v? , = i/P., , = 0. Then 

[-L + 1.-M_i] 

for any positive 6, we have 2/f M ■, = ^p M < <5 for sufficiently large n. We 

[~- ^+ 1 -~ ~~ 1 J [ _ 2~ + 1 '- L ~ 1J 

first derive more refined (than (|A.9[) and (|A.8|) respectively) lower and upper bounds for 
V n - Because of the supposition v? M •, < <5, (|A.6[) and (|A.7p with x — L and the bound 

I -^- + l,-L— II 

< <5 imply that 

vl < S J^1^1 < ^ (A.10) 
by (|A.8p . Applying the estimate f|A.9|) to the right hand side, we have 

v n < ?pL<4S fJln . (A.ll) 

* n 

Therefore 

V"n = E "TOO 
L-l 



H v * V *( vn ) since y_ L (^ 1 ) = I4(^ n ) = by JAJ| 

z=-L+l 
L-l 

> ^2 since V x {v n ) > 1 for x G [-1 + 1, 1 - 1] 

z=-L+l 

> l-8^ n , (A.12) 
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by (KM- 

Forx G [-M f], 

^K) = + m X ~ < -w 1 = < 1 x (A.13) 

2— 2 L J 

by (|A. and the supposition 2/" M ■, < 5. Therefore 



-L 



x=^ + l 



tJ- r+4(5 (A.14) 

" l-2(<5 + 4<5 Atn ) V j 



using (j A. 13|) for the first sum and (|A.8|) for the second. 
Let r = m =^ +1 ,m], then 



> t~ ■ (A.15) 
1 — r 

Condition (|A.6|1 applied with x = L — 1 implies 

(1 - 2n n )vl_ 1 V L -i{v n ) < V(v n )v2-i, 

hence 

V(v n ) 
1 - 2/i n 

Then (|A.14j) and (|A.15j) imply 

1 <-^-r— ^_- + 4 d : 



l-r - l-2/i n Vl-2(5 + 4^„) 
i.e. 

1 _ r > 1 - 2/in = (1 -2//„)(l -2J- 8g//„) > l-3g = 1 _ 75 



4(5 1 + 4(5 - 8(5 2 - 32(5Vn ~ 1 + 4(5 1 + 4(5 



l-2((5+4(5/i„) 

for sufficiently small fj, n . Therefore 

The above inequality and the supposition vl l M ■, = i/p„ < (5 imply that 

— -1J [-5- + 1.L-1J 



f- i+ i,-*-i] = "p+i.i-i] < 8*. (A.16) 
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Let p — . We bound Vi(v n ). Since site —L is the only site in the support of v n 
that does not compete with site I, we have 

1 < V^n) = = _JSi_ < _J__ (A . 17) 

by (|A.11[) . We will establish the following lemma after the proof of the present theorem: 

Lemma A. 0.15. Let v n = be as in Theorem \ A.0. 14\ and suppose i'l^i = 
v [l+i < ^5, then 

1. and vf is bounded away from as n — > oo; 
2- v?_\ < v? + 8 for sufficiently large n. 

Since V[_ L+1 < S<5 by (|A.16|) . we have from (lA.llj) . 

because sites — L and — L + 1 are the only sites in [-L, L] that do not compete with site 
I + 1. We estimate £ v l)VyV v {v n ) - uf 



y 

= Mn^-iVJ-rK) + (1 - 2 M „)if V^ n ) + AWj+iVMO - "f 

- + + l-45//„ + l-8J-4* / i n - p ' 

using i/j^ < p + 5, Vi-x{v n ) = 1, (|A.17|) . i/^ x < (5, and (|A.18|) . Simplifying the right hand 
side of the above, we get 

A^lSp + p- fi n p fj, n S 



1 - 45//„ 1 - 85 - 4<5//„ 

-4/4 <j p _ ^ nP + 4// n< 5p /t n <5 



//n<5 



1 - 4(5/t„ ^ 1-8(5-45// 

/z„p(l -45) //„(5 

< : TT (-Mn<5 ' 



1 - 4<5//„ 1 - 8(5 - 4(5//„ 

< 

2 

for sufficiently small 8 and //„. This estimate means that after resampling and mutation 
(and before normalization), i/™ decreases by at least fi n p/2. On the other hand, uj\_ 1 can 
only increase: let q — then q < S and 

y 

= ^Vl^) + (1 - 2 AlB )l/P +1 V r ,+l(l/ n ) + Mn^WO - ^ + 1 

> /t n p + (1 - 2/j, n )q - q, 
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since Vi{v n ) > 1 and Vi+i(u n ) > 1. Therefore 

A(y, I + \y^V y (v n ) - v? +x > H n (p - 2q) > fi n (p - 26) > 

v 

if 5 is small enough. After normalization, i.e. dividing by V(v n ), and vf+i cannot possibly 
return to their original values. This contradicts the assumption that v n is an stationary 
distribution for (|A.1[) with mutation kernel A^ n , and the proof is complete. □ 

Proof of Lemma IA.0.151 Define 

C x =KV x {v n ). (A.19) 

Notice that Q x depends on n, but notationally we suppress this dependence. For x G 
[-1 + 1,1- 1], ( x = vl since V x {v n ) = 1 from (|A.7j) . For x G [-1 + 1,1- 1], we rewrite 
condition (|A.6|) as follows: 

MnCx-1 + (1 - 2/i„ - V n )( x + = 0. (A.20) 

This is a recurrence relation with general solution ( x = A(3f + Bf3% , where (3\ and Pi are the 
two roots of the quadratic polynomial fi n + (1 — 2/j, n ~ V n )r + /J. n r 2 ; or £ x = (A + Bx)Pf, 
where Pi is the double root of the polynomial. Elementary calculation shows that for the 
solution Q x — (A + Bx)Pf to satisfy the symmetry requirement for L > 1, either B = or 
0i = 0; 0i = leads to the solution of ( x = 0, and B — leads to the conclusion Pi = 1 
and ( x = A; both these two scenarios will be included in Case 2 below. For the solution 
Cx — Aj3f + Bf3%, simple calculation leads to: 

Pi, Ik = 7T~ ( 2/i„ + V n - 1 ± J(V n - l) 2 + 4fl n (V n - 1) 

We divide into three cases: 

1. If Pi and Pi are two real roots, then since C, is symmetric, we must have Pi = I/P2 
with Pi > 0, and the solution is ( x = A(pf + /3j~ x ) for x G [-/, I]. 

2. If Pi — P2, then the solution is ( x = A for x G [—1, 1}. 

3. If Pi and Pi are complex roots, then we write Pi = ^e %9 and Pi — je~ 10 , and the 
solution is ( x = A-fcos(x9) for x G [—/,/]■ Define 

a n =F„-l, (A.21) 

then for Pi and Pi to be complex, a 2 +4/i„a„ = a„(«„ + 4/i„) < 0, which means that 

either a n < and a n + 4^„ > (A.22) 
or a n > and a n + 4^„ < 0. (A. 23) 
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Now (|A.23|) is clearly impossible since /J, n > 0, and (IA.22[> implies that a„ < 0. 
Furthermore, 



tan 6 = 



1/2 



(a„ + 2/i„) 2 V _|_ 4^ nQ!r 

4 ^ x ^ 2 



a 2 + 4/i„a r , 



v -1/2 
1 \ 



(A.24) 



I 1 I ( On \2 I 

Hence (IA.12|) and (|A.2ip imply that a n > —88fi n , and since a n < 0,we have 

< -a n /fj, n < 8(5. 

We conclude from (|A.24[) that for sufficiently small 8, tan 9 is also very small, 

Note that —I and I are the boundary sites for (|A.20[) , therefore statements about 
Q x in the three cases above all hold for x G [— Z,Z], even though (|A.20[) holds for only 

xe [-1 + 1,1-1]. 

In case 1, £ is a linear combination of two convex functions, therefore C is convex 
for x G [—1, I]. In case 2, £ is flat for x G [—1, 1]. And in case 3, £ is concave for x G [— Z, Z], 
but since is small for small 5, it is close to being flat for small 8. Therefore recalling the 
definition of ( x in (|A.19[) and using (I A. 17|) . we have 

(l-4^ n )Ci < v? < Q. 

In summary, for x G [— Z, Z], £ is convex, or flat, or nearly flat for small enough 8] v™ = Cx 
for x G [— I + 1, Z — 1] and vf — is smaller than but very close to Q = furthermore, 
by (|A.11|) and (|A.16|) . we have 

*f M] = 1 - ^ - *f- i+ l,-*-l] - ^+1,L-1] - f£ > 1 - 8<W - 165, 

i.e. v n has almost all its mass on [— Z, Z]. We can then use the symmetry assumption on u n 
to arrive at the conclusion of the lemma. □ 
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